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Preface 


The course presented in this text concentrates on the typical methods of 
modern set theory: transfinite induction, Zorn’s lemma, the continuum hy- 
pothesis, Martin’s axiom, the diamond principle <0', and elements of forcing. 
The choice of the topics and the way in which they are presented is subordi- 
nate to one purpose - to get the tools that are most useful in applications, 
especially in abstract geometry, analysis, topology, and algebra. In par- 
ticular, most of the methods presented in this course are accompanied by 
many applications in abstract geometry, real analysis, and, in a few cases, 
topology and algebra. Thus the text is dedicated to all readers that would 
like to apply set-theoretic methods outside set theory. 

The course is presented as a textbook that is appropriate for either a 
lower-level graduate course or an advanced undergraduate course. How- 
ever, the potential readership should also include mathematicians whose 
expertise lies outside set theory but who would like to learn more about 
modern set-theoretic techniques that might be applicable in their field. 

The reader of this text is assumed to have a good understanding of 
abstract proving techniques, and of the basic geometric and topological 
structure of the n-dimensional Euclidean space R". In particular, a com- 
fort in dealing with the continuous functions from 1" into R is assumed. A 
basic set-theoretic knowledge is also required. This includes a good under- 
standing of the basic set operations (union, intersection, Cartesian product 
of arbitrary families of sets, and difference of two sets), abstract functions 
(the operations of taking images and preimages of sets with respect to func- 
tions), and elements of the theory of cardinal numbers (finite, countable, 
and uncountable sets.) Most of this knowledge is included in any course in 
analysis, topology, or algebra. These prerequisites are also discussed briefly 
in Part I of the text. 

The book is organized as follows. Part I introduces the reader to ax- 
iomatic set theory and uses it to develop basic set-theoretic concepts. In 
particular, Chapter 1 contains the necessary background in logic, discusses 
the most fundamental axioms of ZFC, and uses them to define basic set- 
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theoretic operations. In Chapter 2 the notions of relation, function, and 
Cartesian product are defined within the framework of ZFC theory. Related 
notions are also introduced and their fundamental properties are discussed. 
Chapter 3 describes the set-theoretic interpretation of the sets of natural 
numbers, integers, and real numbers. Most of the facts presented in Part I 
are left without proof. 

Part II deals with the fundamental concepts of “classical set theory.” 
The ordinal and cardinal numbers are introduced and their arithmetic is 
developed. A theorem on definition by recursion is proved and used to 
prove Zorn’s lemma. Section 4.4 contains some standard applications of 
Zorn’s lemma in analysis, topology, and algebra. 

Part III is designed to familiarize the reader with proofs by transfi- 
nite induction. In particular, Section 6.1 illustrates a typical transfinite- 
induction construction and the diagonalization argument by describing 
several different constructions of subsets of R n with strange geometric prop- 
erties. The two remaining sections of Chapter 6 introduce the basic ele- 
ments of descriptive set theory and discuss Borel and Lebesgue-measurable 
sets and sets with the Baire property. Chapter 7 is designed to help the 
reader master the recursive-definitions technique. Most of the examples 
presented there concern real functions, and in many cases consist of the 
newest research results in this area. 

Part IV is designed to introduce the tools of “modern set theory” : Mar- 
tin’s axiom, the diamond principle <0', and the forcing method. The overall 
idea behind their presentation is to introduce them as natural refinements 
of the method of transfinite induction. Thus, based on the solid founda- 
tion built in Part III, the forcing notions and forcing arguments presented 
there are obtained as “transformed” transfinite-induction arguments. In 
particular, the more standard axiomatic approach to these methods is de- 
scribed in Chapter 8, where Martin’s axiom and the diamond principle are 
introduced and discussed. Chapter 9 is the most advanced part of this text 
and describes the forcing method. Section 9.1 consists of some additional 
prerequisites, mainly logical, necessary to follow the other sections. In Sec- 
tion 9.2 the main theoretical basis for the forcing theory is introduced while 
proving the consistency of ZFC and the negation of CH. In Section 9.3 is 
constructed a generic model for ZFC+<> (thus, also for ZFC+CH). Sec- 
tion 9.4 discusses the product lemma and uses it to deduce a few more 
properties of the Cohen model, that is, the model from Section 9.2. The 
book finishes with Section 9.5 in which is proved the simultaneous con- 
sistency of Martin’s axiom and the negation of the continuum hypothesis. 
This proof, done by iterated forcing, shows that even in the world of the 
“sophisticated recursion method” of forcing the “classical” recursion tech- 
nique is still a fundamental method of set theory - the desired model is 
obtained by constructing forcing extensions by transfinite induction. 
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It is also worthwhile to point out here that readers with different back- 
grounds will certainly be interested in different parts of this text. Most 
advanced graduate students as well as mathematical researchers using this 
book will almost certainly just skim Part I. The same may be also true 
for some of these readers for at least some portion of Part II. Part III and 
the first chapter of Part IV should be considered as the core of this text 
and are written for the widest group of readers. Finally, the last chapter 
(concerning forcing) is the most difficult and logic oriented, and will prob- 
ably be of interest only to the most dedicated readers. It certainly can be 
excluded from any undergraduate course based on this text. 
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Axiomatic set theory 


1.1 Why axiomatic set theory? 

Essentially all mathematical theories deal with sets in one way or another. 
In most cases, however, the use of set theory is limited to its basics: el- 
ementary operations on sets, fundamental facts about functions, and, in 
some cases, rudimentary elements of cardinal arithmetic. This basic part 
of set theory is very intuitive and can be developed using only our “good” 
intuition for what sets are. The theory of sets developed in that way is 
called “naive” set theory, as opposed to “axiomatic” set theory, where all 
properties of sets are deduced from a fixed set of axioms. 

Clearly the “naive” approach is very appealing. It allows us to prove a 
lot of facts on sets in a quick and convincing way. Also, this was the way 
the first mathematicians studied sets, including Georg Cantor, a “father 
of set theory.” However, modern set theory departed from the “paradise” 
of this simple-minded approach, replacing it with “axiomatic set theory,” 
the highly structured form of set theory. What was the reason for such a 
replacement? 

Intuitively, a set is any collection of all elements that satisfy a certain 
given property. Thus, the following axiom schema of comprehension, 
due to Frege (1893), seems to be very intuitive. 


If ip is a property, then there exists a set Y = {X: <p(X)} of all 
elements having property <p. 


This principle, however, is false! It follows from the following theorem of 
Russell (1903) known as Russell’s antinomy or Russell’s paradox. 
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Russell’s paradox There is no set S = {X : X c/L X}. 

The axiom schema of comprehension fails for the formula <p(X) defined 
as “X ^ X.” To see it, notice that if S had been a set we would have had 
for every Y 

Y gS^Y £Y. 

Substituting S for Y we obtain 

SeS^ S #S, 
which evidently is impossible. 

This paradox, and other similar to it, convinced mathematicians that 
we cannot rely on our intuition when dealing with abstract objects such as 
arbitrary sets. To avoid this trouble, “naive” set theory has been replaced 
with axiomatic set theory. 

The task of finding one “universal” axiomatic system for set theory 
that agrees with our intuition and is free of paradoxes was not easy, and 
was not without some disagreement. Some of the disagreement still ex- 
ists today. However, after almost a century of discussions, the set of ten 
axioms/schemas, known as the Zermelo-Fraenkel axioms (abbreviated as 
ZFC, where C stands for the axiom of choice), has been chosen as the most 
natural. These axioms will be introduced and explained in the next chap- 
ters. The full list of these axioms with some comments is also included in 
Appendix A. 

It should be pointed out here that the ZFC axioms are far from “per- 
fect.” It could be expected that a “perfect” set of axioms should be com- 
plete, that is, that for any statement <p expressed in the language of set 
theory (which is described in the next section) either ip or its negation is a 
consequence of the axioms. Also, a “good” set of axioms should certainly 
be consistent, that is, should not lead to a contradiction. Unfortunately, we 
cannot prove either of these properties for the ZFC axioms. More precisely, 
we do believe that the ZFC axioms are consistent. However, if this belief 
is correct, we can’t prove it using the ZFC axioms alone. Does it mean 
that we should search for a better system of set-theory axioms that would 
be without such a deficiency? Unfortunately, there is no use in searching 
for it, since no “reasonable” set of axioms of set theory can prove its own 
consistency. This follows from the following celebrated theorem of Godel. 

Theorem 1.1.1 (Godel’s second incompleteness theorem) Let T be a set 
of axioms expressed in a formal language C (such as the language of set 
theory described in Section 1.2) and assume that T has the following “rea- 
sonable'’ properties. 


(1) T is consistent. 
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(2) There is an effective algorithm that decides for an arbitrary sentence 
of the language C whether it is in T or not. 

(3) T is complicated enough to encode simple arithmetic of the natural 
numbers. 

Then there is a sentence p of the language C that encodes the statement 
“T is consistent. ” However, tp is not a consequence of the axioms T. 

In other words, Theorem 1.1.1 shows us that for whatever “reasonable” 
systems of axioms of set theory we choose, we will always have to rely on 
our intuition for its consistency. Thus, the ZFC axioms are as good (or 
bad) in this aspect as any other “reasonable” system of axioms. 

So what about the completeness of the ZFC axioms? Can we prove 
at least that much? The answer is again negative and once again it is a 
common property for all “reasonable” systems of axioms, as follows from 
another theorem of Godel. 

Theorem 1.1.2 (Godel’s first incompleteness theorem) Let T be a set of 
axioms expressed in a formal language C ( such as the language of set theory 
described in Section 1.2) and assume that.T has the following “ reasonable ” 
properties. 

(1) T is consistent. 

(2) There is an effective algorithm that decides for an arbitrary sentence 
of the language C whether it is in T or not. 

(3) T is complicated enough to encode simple arithmetic of the natural 
numbers. 

Then there is a sentence p of the language C such that neither p nor its 
negation ->p can be deduced from the axioms T. 

A sentence <p as in Theorem 1.1.2 is said to be independent of the 
axioms T. It is not difficult to prove that a sentence tp is independent of 
the consistent set of axioms T if and only if both TU{p} and T U {-></?} are 
consistent too. Part of this course will be devoted to studying the sentences 
of set theory that are independent of the ZFC axioms. 

The preceding discussion shows that there is no way to find a good 
complete set of axioms for set theory. On the other hand, we can find a 
set of axioms that reach far enough to allow encoding of all set-theoretic 
operations and all classical mathematical structures. Indeed, the ZFC ax- 
ioms do satisfy this requirement, and the rest of Part I will be devoted to 
describing such encodings of all structures of interest. 
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1.2 The language and the basic axioms 

Any mathematical theory must begin with undefined concepts. In the case 
of set theory these concepts are the notion of a “set” and the relation “is 
an element of” between the sets. In particular, we write “ x £ y” for u x is 
an element of y.” 

The relation £ is primitive for set theory, that is, we do not define it. 
All other objects, including the notion of a set, are described by the axioms. 
In particular, all objects considered in formal set theory are sets. (Thus, 
the word “set” is superfluous.) 

In order to talk about any formal set theory it is necessary to specify 
first the language that we will use and in which we will express the axioms. 
The correct expressions in our language are called formulas. The basic for- 
mulas are “x £ y” and “z = t,” where x, y, z, and t (or some other variable 
symbols) stand for the sets. We can combine these expressions using the 
basic logical connectors of negation conjunction &, disjunction V, impli- 
cation — >, and equivalence <->. Thus, for example, —up means “not <p” and 
p—*ip stands for “ip implies ip.” In addition, we will use two quantifiers: 
existential 3 and universal V. Thus, an expression \/xp is interpreted as 
“for all x formula p holds.” In addition, the parentheses “(” and “)” are 
used, when appropriate. 

Formally, the formulas can be created only as just described. However, 
for convenience, we will also often use some shortcuts. For example, an ex- 
pression 3x £ Ap{x) will be used as an abbreviation for 3x(x £ A & p(x)), 
and we will write Vx £ Ap(x) to abbreviate the formula \/x(x £ A—*tp{x)). 
Also we will use the shortcuts x y, x ^ y, x C y, and x <£_ y, where, for 
example, x C y stands for \/z(z £ x—>z £ y). 

Finally, only variables, the relations = and £, and logical symbols al- 
ready mentioned are allowed in formal formulas. However, we will often 
use some other constants. For example, we will write x = 0 (x is an empty 
set) in place of -G y(y £ x). 

We will discuss ZFC axioms throughout the next few sections as they 
are needed. Also, in most cases, we won’t write in the main text the 
formulas representing the axioms. However, the full list of ZFC axioms 
together with the formulas can be found in Appendix A. 

Let us start with the two most basic axioms. 

Set existence axiom There exists a set: 3x{x = x). 

Extensionality axiom If x and y have the same elements, then x is equal 
to y. 

The set existence axiom follows from the others. However, it is the 
most basic of all the axioms, since it ensures that set theory is not a trivial 
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theory. The extensionality axiom tells us that the sets can be distinguish 
only by their elements. 

Comprehension scheme (or schema of separation) For every for- 
mula p(s, t) with free variables s and t, set x, and parameter p there 
exists a set y = {u € x: <p(u,p)} that contains all those u £ x that 
have the property p. 

Notice that the comprehension scheme is, in fact, the scheme for in- 
finitely many axioms, one for each formula ip. It is a weaker version 
of Frege’s axiom schema of comprehension. However, the contradiction 
of Russell’s paradox can be avoided, since the elements of the new set 
y = {u £ x : c p(u,p )} are chosen from a fixed set x, rather than from an 
undefined object such as “the class of all sets.” 

From the set existence axiom and the comprehension scheme used with 
the formula “tt u,” we can conclude the following stronger version of the 
set existence axiom. 

Empty set axiom There exists the empty set 0. 

To see the implication, simply define 0 = {y £ x: y yf y}, where a; is a 
set from the set existence axiom. Notice that by the extensionality axiom 
the empty set is unique. 

An interesting consequence of the comprehension scheme axiom is the 
following theorem. 

Theorem 1.2.1 There is no set of all sets. 

Proof If there were a set S of all sets then the following set 

Z = {X £ S: X (jL X} 

would exist by the comprehension scheme axiom. However, with S being 
the set of all sets, we would have that Z = { X : X ^ A}, the set from 
Russell’s paradox. This contradiction shows that the set S of all sets cannot 
exist. □ 

By the previous theorem all sets do not form a set. However, we some- 
times like to talk about this object. In such a case we will talk about a 
class of sets or the set-theoretic universe. We will talk about classes only 
on an intuitive level. It is worth mentioning, however, that the theory of 
classes can also be formalized similarly to the theory of sets. This, however, 
is far beyond the scope of this course. Let us mention only that there are 
other proper classes of sets (i.e., classes that are not sets) that are strictly 
smaller than the class of all sets. 
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The comprehension scheme axiom is a conditional existence axiom, that 
is, it describes how to obtain a set (subset) from another set. Other basic 
conditional existence axioms are listed here. 

Pairing axiom For any a and b there exists a set x that contains a and 

b. 

Union axiom For every family F there exists a set U containing the union 
(J F of all elements of F . 

Power set axiom For every set X there exists a set P containing the set 
V{X) (the power set) of all subsets of X. 

In particular, the pairing axiom states that for any a and b there exists 
a set x such that {a, 6} C x. Although it does not state directly that there 
exists a set {a, 6}, the existence of this set can easily be concluded from 
the existence of x and the comprehension scheme axiom: 

{a, b} = {u £ x: u = a V u = b}. 

Similarly, we can conclude from the union and power set axioms that for 
every sets F and X there exist the union of F 

\Jp= {x: 3F £ F {x £ F)} = {x £ U : 3F £ F {x £ F)} 

and the power set of X 

P(X) = {z: zc X} = {z£P: zc X}. 

It is also easy to see that these sets are defined uniquely. Notice also that 
the existence of a set {a, b} implies the existence of a singleton set {a}, 
since {a} = {a, a}. 

The other basic operations on sets can be defined as follows: the union 
of two sets x and y by 

xUy = (J{ar, y}; 
the difference of sets x and y by 

x\ y = {z £ x: z qL y}-, 
the arbitrary intersections of a family F by 

[^\F = [z £[jF:^F £ F (z £ F)]- 
and the intersections of sets x and y by 

xt3y = f"){x, 2 /}. 
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The existence of sets x\y and P| T follows from the axiom of comprehension. 

We will also sometimes use the operation of symmetric difference of two 
sets, defined by 

xA y = (x\y) U (y\x). 

Its basic properties are listed in the next theorem. Its proof is left as an 
exercise. 

Theorem 1.2.2 For every x,y, and z 

(a) xAy = yAx, 

(b) xAy = (x U y) \ (x (~l y ) , 

(c) (xAy)Az = xA(yAz). 

We will define an ordered pair (a, b) for arbitrary a and b by 

(a, 6) = {{a}, {a, 6}}. (1.1) 

It is difficult to claim that this definition is natural. However, it is com- 
monly accepted in modern set theory, and the next theorem justifies it by 
showing that it maintains the intuitive properties we usually associate with 
the ordered pair. 

Theorem 1.2.3 For arbitrary a,b,c, and d 

(a, b ) = (c, d) if and only if a = c and b = d. 

Proof Implication -4= is obvious. 

To see the other implication, assume that (a, b) = ( c, d ). This means 
that {{a}, {a, 6}} = {{c}, {c, d}}. In particular, by the axiom of extension- 
ality, {a} is equal to either {c} or {c, d}. 

If {a} = {c} then a = c. If {a} = {c, d}, then c must belong to {a} and 
we also conclude that a = c. In any case, a = c and we can deduce that 
{{a}, {a, b}} = {{a}, {a, d}}. We wish to show that this implies b = d. 

But {a, b} belongs to {{a}, {a, d}}. Thus we have two cases. 

Case 1: {a, b} = {a, d}. Then b = a or b = d. If b = d we are done. If 
b = a then {a, b} = {a} and so {a, d} = {a}. But d belongs then to {a} 
and so d = a. Since we had also a = b we conclude b = d. 

Case 2: {a, b} = {a}. Then b belongs to {a} and so b = a. Hence we 
conclude that {a, d} = {a}, and as in case 1 we can conclude that b = d. 
□ 
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Now we can define an ordered triple (a, b, c ) by identifying it with 
{(a,b),c) and, in general, an ordered n-tuple by 

(cti , $2 j • • * ; On— 1 ? On) — ( (ui , ; • • • j O n —\) , tt n ) . 

The agreement of this definition with our intuition is given by the following 
theorem, presented without proof. 

Theorem 1.2.4 (ai,a 2 ,... , a„_i, a n ) = (a\, a' 2 , ■ ■ ■ ,a' n _ 1 ,a' n ) if and only 
If °>i = a'i for all i = 1,2,... ,n. 

Next we will define a Cartesian product X x Y as the set of all ordered 
pairs ( x , y) such that x G X and y G Y. To make this definition formal, 
we have to use the comprehension axiom. For this, notice that for every 
x € X and y GY we have 

(x,y) = {{x},{x,y}} GP(V(X UY)). 

Hence, we can define 

X x Y = {z G V(V(X U V)) : 3x G X 3y G Y (z = (x, y))j. (1.2) 

The basic properties of the Cartesian product and its relation to other 
set-theoretic operations are described in the exercises. 

The last axiom we would like to discuss in this section is the infinity 
axiom. It states that there exists at least one infinite set. This is the 
only axiom that implies the existence of an infinite object. Without it, 
the family T of all finite subsets of the set of natural numbers would be a 
good “model” of set theory, that is, T satisfies all the axioms of set theory 
except the infinity axiom. 

To make the statements of the infinity axiom more readable we intro- 
duce the following abbreviation. We say that y is a successor of x and write 
y = S(x) if y = x U {x}, that is, when 

V z[z G y <-> (z G x V 2 = x)\. 

Infinity axiom (Zermelo 1908) There exists an infinite set (of some spe- 
cial form): 

3x \iz{z = 0— >2 G x) & Vy G xVz(z = S(y)—rz G x)] . 


Notice that the infinity axiom obviously implies the set existence axiom. 
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EXERCISES 

1 Prove that if F € T then P| T C F C (J IF. 

2 Show that for every family T and every set A 

(a) if A C F for every F £ T then A C f] T , and 

(b) if F C A for every F € T then 1J T C A. 

3 Prove that if T fl Q 0 then f'l.F fl H 0 C fl {F C\Q). Give examples 
showing that the inclusion cannot be replaced by equality and that the 
assumption T fl Q ^ 0 is essential. 

4 Prove Theorem 1.2.2. 

5 Show that (( a,b),c ) = ((a',6'),c / ) if and only if (a, (b,c)) = (a', (6 , ,c / )) 
if and only if a = a', b = b' , and c = e' . Conclude that we could define an 
ordered triple ( a,b,c } as (a, (b,c)) instead of (( a,b),c ). 

6 Prove that X x Y = 0 if and only if X = 0 or Y = 0. 

7 Show that for arbitrary sets X,Y, and Z the following holds. 

(a) (XUY) x Z = (X x Z)L)(Y x Z). 

(b) (X HY) x Z = (X x Z)n(Y x Z). 

(c) (X\Y)x Z = (X x Z)\(Y x Z). 

8 Prove that ii X x Z C Y x T and IxZ/0 then X C Y and Z C T. 
Give an example showing that the assumption IxZ/0 is essential. 



Chapter 2 


Relations, functions, and 
Cartesian product 

2.1 Relations and the axiom of choice 

A subset R of a Cartesian product IxFis called a (binary) relation. 

For a relation R of a Cartesian product X x Y, we usually write aRb 
instead of (a, b) £ R and read: a is in the relation R to 6, or the relation 
R between a and b holds. 

The domain dom(i?) of a relation R is defined as the set of all x such 
that ( x , y) € R for some y £Y, that is, 

clom(i?) = {x £ X : 3y £ Y ((x, y ) £ R)}\ 

the range range(-R) of a relation R is defined as the set of all y such that 
(x, y) £ R for some x £ X, that is, 

range(-R) = {y £ Y : 3x £ X ((x,y) £ R)}. 


The set Z = dom(i?) U range(i?) for relation R is called a field of R. 
Notice that R C Z x Z . In this case we often say that R is defined on Z. 

Examples 1. The relation f?< on the set of real numbers R. defined as 
(x,y) £ R< if and only if x < y is usually denoted by <. 1 Notice that 

1 In the examples we will often use notions that you supposedly know from other courses, 
even if we have not yet defined them within the framework of set theory (such as the set 
of real numbers R in this example). This will be used only to help you develop the right 
intuition. We will try to avoid this kind of situation in the main stream of the course. 
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R < is the subset of the plane consisting of those points that are above the 
identity line y = x. The domain and the range of this relation are equal 
to R. 

2. Consider the relation on the set of natural numbers N defined as 
(x,y) £ Rdiv if and only if x and y are different natural numbers greater 
than 1 and x divides y. Then dom(i? c j; v ) = {2, 3,4, ... } and range(i?di v ) is 
the set of all composite natural numbers. 

Let R C X x Y and S C Y x Z. The relation 
{(y,x) £ Y x X : xRy} 

is called the inverse of R and is denoted by R~ 1 . The relation 

{(x, z) £ X x Z : By £ Y ( xRy & ySz)} 

is called the composition of R and S and is denoted by S o R. 

Note that (i?” 1 ) _1 = R, dom(i? _1 ) = range(-R), range(i? _1 ) = dom(i?), 
dom(S'oi?) c dom(iZ), and range(5oi?) C range (S’). Moreover, (S'oi?,) -1 = 
i? _1 o S' -1 . 

Examples 1. If < and > are defined on R in the natural way then 
(<)- 1 = >- 

2. (>) o (<) is equal to the relation R x R. 

3- (<) o (<) is equal to <. 

Let R be a binary relation on X x X. We say that R is 
reflexive if xRx for every x £ X; 

symmetric if xRy implies yRx for every x, y £ X; and 
transitive if xRy and yRz imply xRz for every x,y,z € X. 


Examples 1. The relations < and > on R are transitive, but they are 
neither reflexive nor symmetric. 

2. The relations < and > on R are transitive and reflexive, but not sym- 
metric. 

3. The relation ^ on R is symmetric, but is neither reflexive nor transitive. 

4. The relation 0 C R x R is symmetric and transitive, but is not reflexive. 

5. The relation R on R defined by xRy if and only if y — x 2 has neither of 
the three properties. 

6. The relation = on R is reflexive, symmetric, and transitive. 



14 


2 Relations, functions, and Cartesian product 


A binary relation R C X x X is an equivalence relation on X if it 
is reflexive, symmetric, and transitive. Equivalence relations are usually 
denoted by symbols such as «, or =. 

Examples 1. The relation = on any set X is an equivalence relation. 

2. If L is the family of all straight lines on the plane and R is the relation 
on L of being parallel, then R is an equivalence relation. 

3. Let C be the family of all Cauchy sequences (a n ) = (ai ,a 2 ,...) of 
rational numbers. Define relation ~ on C by (a n ) ~ (b n ) if and only if 
lim„^ 00 (a n — b n ) = 0. Then ~ is an equivalence relation. 

4. A nonempty family X C V{X) is said to be an ideal on X if for every 
A, Be V(X) 

Ac B kB el-tdel and A,B Cl -> Al) B el. 

For every ideal X on a set V(X) the relation 

ArB if and only if AAB C X 
is an equivalence relation. 

Let X be a set. A family T of nonempty subsets of X is said to be a 
partition of X if (J T = X and sets belonging to T are pairwise disjoint, 
that is, for every A, B C T either A = B or AC B = /). 

For an equivalence relation E on X and x C X the set 

M ={yCX: xEy} 

is called the equivalence class of x (with respect to E). The family 

{[»] CV(X): x C X} 

of all equivalence classes for E is denoted by X/E and is called the quotient 
class of X with respect to E. 

For a partition T of a set X let us define a relation Rj- on X by 
xR^y <t=> 3 F C T (x C F k y C F ). 


Theorem 2.1.1 

(A) If E is an equivalence relation on X then the family X/E of all 
equivalence classes forms a partition of X, that is, for every x,y C X 
either [x] = [y] or [x] n [y] = 0. 

(B) If T is a partition of X then R^r is an equivalence relation on X. 
Moreover, X/R^r = T and Rx/e = E. 
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The proof is left as an exercise. 

Theorem 2.1.1 shows that there is a one-to-one correspondence between 
the class of all partitions of X and the family of all equivalence relations 
on X. In general, equivalence classes generalize the notion of equality. 
Elements within one equivalence class are identified by the relation. 

An element x of an equivalence class C is called a representative of C. 
A set of representatives of an equivalence relation E is a set that contains 
exactly one element in common with each equivalence class. Notice that, 
by Theorem 2.1.1 and the axiom of choice, to be stated shortly, a set of 
representatives exists for every equivalence relation. 

Axiom of choice For every family T of nonempty disjoint sets there ex- 
ists a selector, that is, a set S that intersects every F € T in precisely 
one point. 

The axiom of choice (usually abbreviated as AC) has the conditional 
existence character of the pairing, union, and power set axioms. However, 
it has also a very different character, since a selector, which exists by this 
axiom, does not have to be unique. This nonconstructive character of the 
axiom of choice was, in the past, the reason that some mathematicians 
(including Borel and Lebesgue) did not like to accept it. However, the 
discussion on the validity of the axiom of choice has been for the most part 
resolved today, in favor of accepting it. 

The axiom of choice will be one of the most important tools in this 
course. 


EXERCISES 

1 Let R C X xY and S C Y x Z be the relations. Prove that dom(i? -1 ) = 
range(-R), dom(S o R) C dom(R), and range(S of?) c range(S). 

2 Show the formulas (A. US') -1 = i? -1 US -1 , (f?nS) -1 = J? -1 nS -1 , and 
(, SoR ) -1 = R :i oS '. 

3 Prove the formulas 

(RUS)oT = (RoT)U(SoT), To (f?U5) = (Tof?) U (ToS), 
(RHS)oT C (RoT)n(SoT), To(RnS) C (ToR) n (ToS). 

Find examples of relations R, S, and T that show that the inclusions in 
the display cannot be replaced with equations. 

4 Find examples of relations on R. that are 

(a) reflexive, but neither symmetric nor transitive; 

(b) reflexive and symmetric but not transitive. 

5 Prove Theorem 2.1.1. 
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2.2 Functions and the replacement scheme 
axiom 

A relation R C X x Y is called a function if 

{'^x,y 1 ,y 2 )(xRy 1 & xRy 2 ->yi = y 2 ). 

Functions are usually denoted by the letters f,g,h,... .The domain dom(/) 
and the range range(/) of a function / are defined as for relations. For a 
function / if dom(/) = X and rang e(/) C Y then / is called a function 
(or map or transformation) from X into Y and it is denoted by f: X — > 
Y. If, moreover, range(/) = Y then / is said to be a function (map or 
transformation) from X onto Y, or a surjective function. The set of all 
functions from X into Y is denoted by Y x . 

If / € Y x and x £ X then there exists precisely one y £ Y such that 
xfy. The element y is called the value of / at x and is denoted by /( x). 
Thus, the formula y = f(x) has the same meaning as xfy. 

Notice that for /, g G Y x 

f = g o\/x £ X (f(x) = g(x)). 

A function / : X — > Y is a one-to-one (or injective) function if 

/(*) = f(y) x = y 

for all x, y £ X. A function f : X — > Y is a bijection if it is one-to-one and 
onto Y . 

For f:X—>Y,AcX, and B CY we define 

fl A \ = {/O) : X £ X} = {y £ Y : 3x £ X (y = f(x))} 


and 

f-\B) = {x£X: f(x)£B}. 

We use square brackets in f[A] rather than regular parentheses to avoid 
a double meaning for the symbol f(A) when A is at the same time an 
element of X and its subset. A similar double meaning may happen when 
the symbol f~ 1 (B) is used. However, in this case it will be always clear 
from the context which meaning of the symbol we have in mind. The sets 
f[A] and / _1 (H) are called the image of A and the preimage of B with 
respect to /, respectively. 
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Theorem 2.2.1 If f G Y x then f~ x is a function if and only if f is 
one-to-one. Moreover, if f~ x is a function then / _1 £ X z , where Z = 
range(/), and f~ x is one-to-one. 

Theorem 2.2.2 If f £ Y x and g £ Z x then g o / is also a function and 
g o f £ Z x . Moreover, (g o f)(x) = g(f(x)) for every x £ X. 

Theorem 2.2.3 Let f £ Y x and g £ Z x . If f and g are one-to-one, then 
so is g o f. If f and g are onto, then so is g o /. 

The proofs are left as exercises. 

For the proof of the next theorem we need one more axiom scheme. 

Replacement scheme axiom (Fraenkel 1922; Skolem 1922) For every 
formula <p(s, t, U, w) with free variables s, t, U , and w, set A, and 
parameter p if ip{s,t, A,p) defines a function F on A by F(x) = 
y <t=> (p(x, y, A,p), then there exists a set Y containing the range F[A] 
of the function F, where F[A\ = {F(x) : x £ A}. 

As with the comprehension scheme axiom, the replacement scheme 
axiom is in fact a scheme for infinitely many axioms, one for each for- 
mula ip. In conjunction with the comprehension scheme axiom, the re- 
placement scheme axiom implies that for a function defined by formula 
F(x) = y <t=> ip(x,y,A,p) on a set A, the range of F exists, since it can be 
defined: 

F[A\ = {y £ Y : 3x £ A (y = F(x))}. 

Sometimes the replacement scheme axiom is formulated in a stronger ver- 
sion, which states that the set Y existing by the axiom is equal to the range 
of F. It is worth noticing that such a stronger version of the replacement 
scheme axiom implies the comprehension scheme axiom. 

Now we are ready for the next theorem. 

Theorem 2.2.4 If F is a family of nonempty sets then there is a function 
f: F — > (J F such that f(X) £ X for every X £ F . 

Proof Take a formula tp(X, Y) that can be abbreviated to 


Y = {X} x X. 

Notice that if ip(X,Y) and <p(X,Y') hold then Y = {X} x X = Y'. In 
particular, ip satisfies the assumptions of the replacement scheme axiom. 
So there exists a family 
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G = {Y : 3X G T (p(A, Y))} = {{X} x X : X £ T}. 

Notice that elements of Q are nonempty and that they are pairwise disjoint, 
since 

({A'} x X) n ({A'} x X') = ({A} n {A'}) x (A n A'). 

Thus, by the axiom of choice, there is a selector / for Q. Notice that f is 
the desired function. □ 

The function from Theorem 2.2.4 is called a choice function for the 
family T . In fact, the statement that for every family of nonempty sets 
there exists a choice function is equivalent to the axiom of choice. The 
statement follows from the axiom of choice by Theorem 2.2.4. To see that 
the axiom of choice follows from it define a selector S for a family T of 
nonempty pairwise-disjoint sets as S = rang e(/) when / is a choice function 
for T . 

For a function / G Y x and A C A the relation /fl (A x Y) is a function 
that belongs to Y A . It is called the restriction of / to the set A and is 
denoted by f\ a- 

A function g is said to be an extension of a function / if / C g. We 
also say that / is a restriction of g. 

Theorem 2.2.5 If f and g are functions then g extends f if and only if 
dom(/) C dom(g) and f(x) = g{x) for every x G dom(/). 

The proof is left as an exercise. 

Functions for which the domain is the set N of natural numbers are 
called (infinite) sequences. A (finite) sequence is any function whose do- 
main is any natural number n (= {0, 1, . . . , n — 1}). If a is any sequence, 
then we usually write a m in place of a(m). We will sometimes represent 
such a sequence by the symbol (a m ) or {a m }, indicating the domain of the 
sequence if necessary. 


EXERCISES 


1 Prove Theorem 2.2.1. 

2 Prove Theorem 2.2.2. 

3 Prove Theorem 2.2.3. 


4 Prove Theorem 2.2.5. 
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2.3 Generalized union, intersection, and Car- 
tesian product 

Let F be a function from a nonempty set T into a family of sets Q. We will 
usually write F t instead of F(t) and {F t : t € T} or {F t } t& T for range (Fj. 
The family F = {F t }teT is called an indexed family , with T being the index 
set. 

The following notation will be used for F = 

U^ = U^ 0* = !>- 

teT teT 

In the text these sets will also appear as U tgT fo and flter Z respectively. 

When the set T is fixed, we will sometimes abbreviate this notation and 
write {F t } t or {F t } in place of {F t } teT , and [J t F t and f| t F t for UteT F t and 
n tGT Ft. When the index set of the index family F is a Cartesian product 
S x T then we usually denote its elements by F st instead of F^^ and we 
say that F is a doubly indexed family. We denote it as {F st : s £ S, t £ T} 
or simply {F st }. In such a case we will write Uses teT Fst or U s t F st for 

U ( s ,t)eSxT F s t and PlseS, teT d or f~l s ,t Gsi for D ( s ,t)eSxT F s t- If S = T 

we will also write (J, ter Fst and P| s teT F st . 

The following properties are easy to verify: 


U Ft 3t £ T (x £ Ft) 

and x G p| F t \/t G T (x G F t ) 

teT 

teT 

ci 

ii 

T3 

ii 

Z) 

o’ 

•-S 

II 

teT 

teT 

[J F t = F p U F q and 
teT 

Pl Ft = Fp n F q for T = {p. q}. 

teT 


It is also easy to see that for any formula ip(t, x) and sets T, X 
G X : (p(t, x)} = {x G X : 3t G T x))} 

teT 

o € X: ip(t,x)} = {x £ X : \/t £ T ( ip(t,x ))}. 
teT 

Other properties of these operations are listed in the exercises. 

The behavior of generalized union and intersection under the action of 
image and preimage of a function is described in the next theorem. 
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Theorem 2.3.1 If f £ Y x , {F t }t^T is an indexed family of subsets of X, 
{Gt}teT is an indexed family of subsets of Y , A, B C X , and C, D C Y 
then 

( a ) / [Uter Ft] = Uter f[Ft]i 

(b) / [Pi teT Ft] C f] teT f[F t \; 

(c) f[A]\f[B]cf[A\B]; 

(d) r 1 {Uter Gt)= UteT r\Gt); 

(e) r i (a e TG t ) = a e T/- 1 (G t ); 

(f) f-\A)\f-\B) = f-\A\B). 

The proof is left as an exercise. 

For an indexed family {F t : t £ T} we define its Cartesian product by 
l[ F t = {h£ Z T :Wt£T ( h{t ) £ F t )}, 

teT 

where Z = IJjgt Ft- If all the sets Ft are identical, F t = Y, then IlfeT Ft = 
Y t . The set Y T is called a Cartesian power of the set Y. 

For t £ T the function p t : Ft — ► F t defined by p t (x) = x(t ) is 

called the projection of IlteT Ft onto F t . 

Remark For T a two-element set {a, b} the Cartesian products liter Ft 
and F a x Fi, are different. (The first one is a set of functions on T, the second 
one is a set of ordered pairs.) However, there is a natural identification of 
every element {(a, x), ( b , y)} from liter Ft with (x, y) € F a x Fi,. Therefore 
we will usually identify these products. 


Theorem 2.3.2 A product liter Ft of nonempty sets F t is nonempty. 

Proof The choice function / for the family { F t : t £ T}, which exists by 
Theorem 2.2.4, is an element of riter^t- ^ 

Remark Theorem 2.3.2 easily implies Theorem 2.2.4. Thus its statement 
is equivalent to the axiom of choice. However, we do not need the axiom 
of choice to prove Theorem 2.3.2 if either T is finite or Uter Ft = Y T . 
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EXERCISES 

1 Prove that for every indexed families {F t }teT and {G t }teT 

( a ) fl teT^t H Gt) = flteT Ft H flteT Ft, 

(b) {J teT (F t U Gt) = UteT Ft U UteT Ft, 

( c ) PlteT Ft U PlteT ^t = Ds.teT^s '-I Gt) C H \ teT (Ft U Gt), and 

(d) UteT Ft n UteT Ft = U s ,ter(-^s ^ Ft) 3 Uter(-^t bl Gt). 

Give examples showing that the inclusions cannot be replaced by equalities. 

2 Show that for every indexed family {F t }t^T and every set A 

(a) A \ HtgT Ft = U ter(A \ F t ) and A \ UteT Ft = fl teri-A \ Ft), 

(b) flteT^ U Ft) = A U PlteT Ft and UteT^ ^ Ft) = A fl UteT Ft- 

3 Prove that for every indexed families {F t }t^T and {G t }teT 

( a ) (UteT-^t) x (UteT Ft) = Us.teT^s x Ft), 

(b) (Dter Ft) x (DteT^*) = fls.teT^s x G t ). 

4 Show that Uses flteT F s t C Uter Use s F st for every doubly indexed 
family { F st : s £ S, t £ T}. 

5 Prove Theorem 2.3.1(a), (b), and (c). Show, by giving examples, that 
the inclusions in parts (b) and (c) cannot be replaced by equality. 

6 Prove Theorem 2.3.1(d), (e), and (f). 

7 For r, s £ R let A r = [r, r + 1] and B rs = [r, s). Calculate U s <o D r >« 
Ds<0 Ur>s Ar, Ur<0 fls>r F rs , and flr<0 U s >r F rs - 

2.4 Partial- and linear-order relations 

A binary relation R on X is antisymmetric if 

xRy & yRx — > x = y 

for every x, y £ X. 

A relation R on A is a (partial-) order relation if it is reflexive, transitive, 
and antisymmetric. Order relations are usually denoted by the symbols <, 
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A, or <C. If < is a partial-order relation then the ordered pair ( X , <) is 
called a partially ordered set (abbreviated also as poset). 

Examples 1. The relations < and > on R are order relations. 

2. For any set X the relation C is an order relation on V(X). 

3. The relation | on the set {2, 3, 4, ... } defined by 

n\m if and only if n divides m 


is an order relation. 

An element m £ X of an ordered set (X, <} is minimal if for every 
x € X the condition x < m implies x = m. Similarly, an element M £ X 
is maximal if for every x € X the condition x > M implies x = M. 

An element m £ X is the smallest element ( least element or first ele- 
ment) in A if m < x for every x £ X, and M £ X is the greatest element 
( largest element or last element ) in X if x < M for every x £ X. 

Theorem 2.4.1 Let (X,<) be a partially ordered set. 

(a) X can have at most one greatest and one smallest element. 

(b) The smallest element of X, if it exists, is the only minimal element 
ofX. 

(c) The greatest element of X, if it exists, is the only maximal element 
ofX. 

Proof (a) If a and b are the smallest elements of X then a < b and b < a. 
Hence a = b. The argument for the greatest element is the same. 

(b) If a is the smallest element of X then it is minimal, since the condi- 
tion x < a combined with a < x, which is true for every x, implies x = a. 
Moreover, if m is minimal, then a < m, since a is the smallest element, 
and, by minimality of to, to = a. 

(c) The argument is the same as in (b). □ 

Examples 1. (R, <) has neither minimal nor maximal elements. 

2. ([0, 1], <) has 0 as the least element and 1 as the last element. 

3. 0 is the smallest element of (V(X),c). X is the greatest element of 
(V(X),C). 

4. Let ({2, 3, 4, . . . }, |) be defined as before. It does not have any maximal 
element. A number to is minimal in this order if and only if to is a prime 
number. (Thus ({2, 3, 4, . . . }, |) has infinitely many minimal elements!) 
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A relation R on A is connected if 

xRy V yRx 

for every x,y £ X. An order relation < is called a linear-order relation if 
it is connected. In this case we also say that ( X , <) (or just X) is linearly 
ordered. 

Examples 1. (R, <) is linearly ordered. 

2. (V(X), c) is linearly ordered if and only if A' has at most one element. 

3. ({2, 3, 4, . . . }, |) is not linearly ordered, since neither 2|3 nor 3|2. 

4. The relation < on N is a linear-order relation. 


Theorem 2.4.2 If (A, <) is linearly ordered then every minimal element 
in X is the smallest element and every maximal element in X is the greatest 
element. In particular, linearly ordered sets can have at most one maximal 
element and at most one minimal element. 

The proof is left as an exercise. 

Notice that if < is an order relation on a set X and Y C X then the 
relation < fl(Y x Y) is an order relation on Y. It is called the restriction 
of < to Y. We often write (Y, <) in place of (Y, < fl(Y x Y)}. 

Notice also that a subset of a partially ordered set is partially ordered 
and a subset of a linearly ordered set is linearly ordered. 

In general, for any partial order denoted by < we will write > for (<)^ x 
and define relations < and > by 

x<y<=>x<y$zx^y 


and 


x>y^x>ySzx^4y. 

EXERCISES 


1 Prove that the restriction < fl(Y x Y) of an order relation < on X is 
an order relation, provided Y C X. Show that (Y, <) is linearly ordered 
if (A, <) is linearly ordered. Give an example such that (Y, <) is linearly 
ordered, while (A, <} is not. 

2 Find all minimal, maximal, greatest, and smallest elements of (A, c), 
where T = {A C N: A is finite}. 


3 Prove Theorem 2.4.2. 
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4 A binary relation R C X x X is said to be a preorder relation if it is 
transitive and reflexive. 

Let Al x A' be a preorder relation. 

(a) Show that the relation ~ on X defined by 

x~y<=>x^ylky^x 
is an equivalence relation on X. 

(b) Define the relation < on the family X/ ~ of all equivalence classes for 
~ by 

[x] < [y] x A y. 

Show that the relation < is well defined and that it is a partial-order 
relation. 



Chapter 3 


Natural numbers, 
integers, and real numbers 

From the results of Section 1.2 it is clear that sets such as 0 , { 0 }, { 0 , { 0 }}, 
{{ 0 }}, and so forth exist. Using the axiom of infinity we can also conclude 
that we can build similar infinite sets. But how do we construct complicated 
sets, such as the sets of natural and real numbers, if the only tools we have 
to build them are the empty set 0 and “braces” {•}? We will solve this 
problem by identifying the aforementioned objects with some specific sets. 

Briefly, we will identify the number 0 with the empty set 0, and the 
other natural numbers will be constructed by induction, identifying n with 
{0,... , n — 1}. The existence of the set N of all natural numbers is guar- 
anteed by the infinity axiom. The real numbers from the interval [0, 1] will 
be identified with the set of functions {0, 1} N , where an infinite sequence 
a: N — > {0, 1} is identified with the binary expansion of a number, that is, 
with a(n)/2 n+1 . The details of these constructions are described in 

the rest of this chapter. 

3.1 Natural numbers 

In this section we will find a set that represents the set N of natural numbers 
in our set-theoretic universe. For this, we need to find for each natural 
number n a set that represents it. Moreover, we will have to show that the 
class of all such defined natural numbers forms a set. 

When picking the natural numbers, we will have to pick also the order- 
ing relation < between them. Essentially, the only relation that we have 
available for this purpose is the relation G. Thus, we will choose the natural 
numbers to satisfy the following principle: 
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in < n if and only if m £ n. 

Also, since natural numbers are going to be distinguished by the relation £ 
of being an element, it seems to be natural to have the following intuitive 
principle: 


Each natural number n should have n elements. 

These two principles give us no choice about our definition. 

By the second principle we have 0 = 0. 

Now suppose that we have already defined n and want to define n + 1. 
Since n<n+lwe have n £ n + 1. Also, for every m £ n we have m < n, 
som<n + l and m £ n+1. In particular, nCn+1 and n U {?r} C n+1. 
But n has n elements, so n U {n} has n + 1 elements, since n ^ n, as 
n -ft n. Therefore nU {?r} C n + 1 and both sets have n+1 elements. Thus, 
n + 1 = n U {?i}. 

By the foregoing discussion we see that 0 = 0, 1 = {0} = {0}, 2 = 
{0, 1} = {0, {0}}, 3 = {0, 1, 2} = {0, {0}, {0, {0}}}, and so forth. 

Now, showing that the class of all such numbers forms a set is another 
problem. We will use for this the axiom of infinity. We will also make 
sure that the following three principles are satisfied, where S(n) = n + 1 
is known as a successor of n and P stands for some property of natural 
numbers. 

PI 0 S(ri) for every n £ N. 

P2 If S(n) = S(m) then n = m for every n,m el 

P3 If 0 has property P and for every n £ N 

S(n) has property P provided n has property P 
then n has property P for every n £ N. 

Principles P1-P3, known as the Peano axioms of arithmetic, are the 
most commonly accepted axioms for the natural numbers. Axiom P3 is 
the principle of mathematical induction. 

Notice that the definition of the successor operator S(x ) = xL){x} from 
the axiom of infinity coincides with the definition given in this section, since 
S(n) = n + 1 = n U {n}. 

Now we are ready to construct the set N. 
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Theorem 3.1.1 There exists exactly one set N such that 

(1) 0 G N, 

(2) x £ N — > S(x) £ N for every x, and 

(3) if K is any set that satisfies (1) and (2) then N C K. 

Proof By the axiom of infinity there exists at least one set X satisfying 
(1) and (2). Let 

T = {Y £ V{X) : 0 G Y & \/x £ Y (S(x) £ F)} 

and put N = f) T . It is easy to see that the intersection of any nonempty 
family of sets satisfying (1) and (2) still satisfies (1) and (2). To see that 
(3) holds let K satisfy (1) and (2). Then InA' £ T and N = C 
X D K C K. Theorem 3.1.1 is proved. □ 

The set N from Theorem 3.1.1 will be called the set of natural numbers. 
Natural numbers will usually be denoted by the letters m,n,p, . . . . The 
operation S(n) is the counterpart to (n+ 1). Thus for a natural number n 
we will henceforth write n + 1 instead of S(n). 

The set of natural numbers will also be denoted by the symbol ui. 
Notice that condition (3) of Theorem 3.1.1 can be rephrased as follows: 

[0 £ K & \/x (x £ K — > S(x) £ K )] implies N C K. (3.1) 

If we additionally know that K C N then (3.1) can be expressed as 

[0 £ K & \/n (n £ K — > n + 1 £ K)} implies N = K. (3.2) 

Property (3.2) is called the induction property of the set of natural numbers 
and the proofs that use it are cited as proofs by induction. 

It is easy to see that the induction property (3.2) implies axiom P3 if 
we put K = {n £ N: n has property P}. Axiom P2 follows from The- 
orem 3.1.2(f). To see that N satisfies PI just notice that 0 = 0 and 
S(n) = n U {n} is clearly nonempty. 

Essentially all the known properties of the natural numbers can be 
deduced from these definitions and the ZFC axioms. 1 However, we will 
restrict ourselves only to those properties that are more connected with 
our representation of the natural numbers and to those that will be of 
more use in what follows. 

1 In fact, the Peano axioms are strictly weaker than the ZFC axioms. More precisely, 
everything that can be proved in the Peano arithmetic can be proved in ZFC. However, 
there are properties of natural numbers that can be deduced from the ZFC axioms but 
cannot be proved on the basis of the Peano axioms alone. 
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Theorem 3.1.2 For every natural numbers m, n, and p 

(a) to C to + 1 and m £ to + 1; 

(b) if x £ m then x £ N; 

(c) if m £ n then in + 1 C n; 

(d) if m £ n and n £ p then in € p; 

(e) to ^ to; 

(f) if n + 1 = to + 1 then n = to; 

(g) to C n if and only if m = n or to £ n; 

(h) to C n or n C to. 

Proof (a) This follows immediately from the definition m+ 1 = toU {to}. 

(b) The proof follows by induction. Let 

K = {n £ N: Va: (a: £ n — » x £ N)}. 

Notice that 0 = 0 £ K, since there is no x £ 0. Now let n £ K and 
x £ n + 1. Since n + 1 = n U {?i} we have two cases: If x £ n then xtN, 
since n £ K; if x £ {n} then x = n and also x £ N. Thus n £ K implies 
n + 1 £ K. Hence, by (3.2), K = N and (b) is proved. 

(c) The proof is left as an exercise. 

(d) Since m £ n and, by (a), nCn + lwe have in £ n + 1. But, by (c), 
n £ p implies n + 1 C p. So to £ p. 

(e) This can be easily proved by induction. It is left as an exercise. 

(f) This is left as an exercise. 

(g) <=: If to = n then to C n. If to G n then, by (a) and (c), to C 
to + 1 C n. 

=>: Fix to £ N and consider the set 

K = {n £ N: m C n — * [to £ n V to = n]}. 

It is enough to show that K = N. 

First notice that 0 £ K since to C 0 = 0 implies to = 0 = 0. 

Now assume that n £ K. By induction it is enough to show that 
n + 1 £ K. So assume that ?nCn+l = nU {?r}. If to n then n £ m 
and, by (c), n + 1 C to C n + 1, that is, to = n + 1. If to C n then, since 
n £ K, either m £ n or m = n. But in both cases to £ n U {n} = n + 1; so 
n+1 £ K. 

(h) Define the set 

K = {to £ N: Vn £ N (n £ to — > m C n)}. 



3.1 Natural numbers 


29 


First we will show that K = N. 

First notice that 0 £ K , since 0 = 0 C n for every n. 

Now assume that rn £ K. We will show that this implies m + 1 £ K. 
So let n € N be such that n^m+l = mU {to}. Then n ^ m and n $ m. 
But to £ K and n m imply that to C n. This last condition and n ^ in 
imply in turn, by (g), that m £ n and, by (c), m+1 C n. Thus we conclude 
that n £ K implies n + l £ K and, by induction, that K = N. 

Now, to finish the proof, let n,m £ N. If n C m then the conclusion is 
correct. So assume that n <f_ to. Then, by (g ), n £ to. Since m £ N = K 
we conclude to C n. □ 

From Theorem 3.1.2 it follows immediately that every natural number 
n = {0, 1,2,... , n — 1}. However, we cannot express it in the formal lan- 
guage of set theory, since the symbol does not belong to its language. 

We can only express it using infinitely many sentences, one for each natural 
number. On the other hand, this informal expression is more intuitive for 
most mathematicians than the formal definition. We will often use it to 
describe the set of natural numbers less than a fixed number. In particular, 
we will very often write 2 instead of {0, 1}. 

Let us also define for natural numbers ?n and n 

m < n if and only if m £ n 


and 

to < n if and only if m C n. 

It is not difficult to see that these definitions imply the properties of < and 
< that we usually associate with them. 

We will finish this section by defining on the set N the arithmetic opera- 
tions of sum, product, and exponentiation. The definitions are by induction 
and go as follows: 

n + 0 = n; n + (to. + 1) = (n + to) + 1, where k + 1 stands for S(k). 

n 0 = 0; n (rn + 1) = (n to) + n. 
n° = 1; n m+1 = n m n. 

It can be proved, by induction, that these operations satisfy all known 
properties of the standard operations. 

EXERCISES 

1 Prove that if T is a nonempty family such that every F £ T satisfies (1) 
and (2) from Theorem 3.1.1 then f) T also satisfies these conditions. 

2 Prove by induction Theorem 3.1.2(c), (e), and (f). 
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3.2 Integers and rational numbers 

Intuitively, we will define integers as differences n — m of two natural num- 
bers. The formalization of this intuition has, however, two problems. First, 
we do not have an operation — defined as of yet. Thus we will talk about or- 
dered pairs (n, m) of natural numbers, thinking of them as n — m. However, 
representing the number n — m. as (n, to) is not unique: 1 = 2 — 1 = 3 — 2. 
So, instead, we will use equivalence classes of pairs of natural numbers, 
where the equivalence relation E on N x N is defined by the formula: 

(n, m)E(n', to/) <t=> n + m! = v! + to. (3-3) 

Now the set of integers Z is defined as the quotient class (N x N)/ E. 

We define a relation < on Z by 

[(n, to)] < [{n' , m')] <=> n+m! < n' + to. (3.4) 

The arithmetic operations on Z are defined as follows: 

[(n, to)] + [(n 7 , m')\ = [(n + n' , to + to 7 )]. 

[(n,?n)] - [{n' ,m')} = [(n, to)] + \{m' ,n')\. 

[(n, to)] [(n 7 , to 7 )] = [(n n' + to to 7 , n to 7 + to n 7 )] . 

We will leave as an exercise the proof that E is an equivalence relation, 
that the preceding operations are well defined (i.e. , they do not depend on 
the choice of representatives of equivalence classes), and that they have the 
properties we know from algebra. 

Following the same path, we define the set Q of rational numbers as 
ordered pairs (a, b) of integers representing of a/b. More precisely, we 
define an equivalence relation E' on Z x Z by 

(a,b)E r (a 1 ,b') <t=> [a b' = a' b & b ^ 0 ^ b'\ or \b = b' = 0]. 

Then 

Q = {[<a,6)]: a,&GZ& b^O}, 

where [(a, b )] is an equivalence class with respect to E' . We define a relation 
< and arithmetic operations on Z as follows: 

[(a,b)] + [(a',b')] = [(ab' + a / b,bb')}-, 

[(a, b)} [(a l ,b')] = [(aa',bb% 

[(a, b )] < [(a 7 , b 7 )] <t=> b > 0 & b' > 0 & ab' < a' b. 

As before, we will leave as an exercise the proof that E' is an equivalence 
relation, that the preceding definitions are well defined, and that they have 
the desired properties. 
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EXERCISES 

1 Prove that the relation E defined by (3.3) is an equivalence relation. 

2 Show that < on Z defined by (3.4) is well defined and is a linear-order 
relation. 


3.3 Real numbers 

In the previous section we constructed integers from the natural numbers 
and rational numbers from the integers. Both constructions were primarily 
algebraic. This was possible since the rational numbers are the algebraic 
completion of the set of natural numbers. The same approach, however, 
cannot be used in a construction of the real numbers, since there are real 
numbers of no simple algebraic relation to rational numbers. 

The crucial geometric property that is often used to described real num- 
bers is that the real line “does not have holes.” This intuition was used by 
Dedekind to define the set R of real numbers as follows. 

A subset A of Q is said to be a Dedekind cut if A does not contain a 
largest element and is a proper initial segment of Q, that is, 0 ^ A ^ Q 
and A contains every p < q provided q € A. Now, R can be defined as 
the set of all Dedekind cuts, intuitively identifying A with a real number 
sup A. Then for Dedekind cuts A and B we can define 

A + B = {p + q \ p £ A &, q £ B} 


and 

A < B <^> A C B. 

Similarly, we can define the product A B of two numbers (though the def- 
inition must be done more carefully, since the product of two negative 
numbers is positive). 

Another approach for the definition of real numbers is to use their 
numerical expansions. This approach is a little more messy, but it better 
fits the methods of this course. So we will include it here too. 

The construction of real numbers that follows will be done in two steps. 
First, consider the set 2 N of all sequences s: N — > {0, 1}. It is usually 
called the Cantor set. (The classical Cantor “ternary” set can be obtained 
from this set by identifying a sequence a € 2 N with N 2a„/3 ra+1 . See 
Section 6.2 for more details.) We would like to identify a sequence a £ 2 N 
with the real number a n/2 n+1 . However, this identification function 

is not one-to-one. To correct it, we will define [0, 1] as the quotient class 
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2 N /E, where aEb if and only if a = b or there exists an n £ N such that 
for every k £ N 

[k < n—>ak = bk\ & [a n = 1 & b n = 0] & [k > n^>{ak = OSzbk = 1)]. (3.5) 

We will leave as an exercise the proof that E is indeed an equivalence 
relation. 

The linear-order relation < on [0, 1] can be defined by 

[a] < [6] <t=> [a] = [6] V 3n £ N [a n < b n & Vfc € n (a* = &&)]. (3.6) 

Again, we will not prove the correctness of this definition. 

Numbers 0 and 1 in [0, 1] are defined as equivalence classes of functions 
constantly equal to 0 and 1, respectively. Then we define 

[ 0 , 1 ) = [ 0 , 1 ] \ { 1 }, ( 0 , 1 ] = [ 0 , 1 ] \ { 0 }, ( 0 , 1 ) = [ 0 , 1 ] \ { 0 , 1 }. 

The set R of real numbers is defined as Z x [0, 1), where intuitively we 
identify a pair (fc, r) with k + r. In particular, we can define < on R by 

(k,r) < (l, s) <=> k < l V (k = l & r < s). (3-7) 

In the remainder of this section we will recall some geometric and topo- 
logical properties of the n-dimensional Euclidean space R" (n £ N). In 
particular, the distance between two points p = (pi , . . . ,p n ) and q = 
(qi, . . . ,q n ) of R” is given by the formula 


d{p, q) 


n 


N 


^iPi-Vif- 


The open ball in R” centered at p £ R" and with radius e > 0 is defined as 


B(p,e) = {q £ R”: d(p,q) < e}. 


A subset U of R" is open if U is a union of some family of open balls in 
R". The family r of all open subsets of R" is called the (natural) topology 
on R ra and is closed under finite intersections and arbitrary unions. 

A set F C R" is closed if its complement R” \ F is open. Notice that 
finite unions of closed sets and arbitrary intersections of closed sets are also 
closed. 

For a subset S of R" its interior int(S') is defined as the largest open 
subset of S, that is, 

int(S') = CR": (/ is open in R”}. 
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The closure of S is the smallest closed set containing S, that is, 

d(S) = p| {F d S: F is closed in R n }. 

A subset D of R” is dense in R n if cl(£>) = R” or, equivalently, when 
D fl U 7 ^ 0 for every nonempty open set U C R n . In particular, Q n is a 
dense subset of R". A subset N of R" is nowhere dense if int(cl(lV)) = 0. 

Now let us recall a few more specific properties of R". A function 
/: R" — > R m is continuous if f~ 1 (U) = {x G R” : /( x) G U} is open in 
R™ for every open set U C R m . It is easy to see that for every dense 
subset D of R” and continuous functions / : R" — > R m and g : R" — > R m if 
f(d) = g(d) for every d € D then / = g. 

A subset B of R" is bounded if B C B{p,e) for some p £ R™ and 
e > 0. Closed, bounded subsets of R” are called compact. One of the most 
important properties of compact subsets of R" is given in the next theorem. 

Theorem 3.3.1 If Kq D K\ D K 2 D • • • is a decreasing sequence of com- 
pact nonempty subsets of R” then their intersection Hto nonempty. 

A subset C of R” is connected if there do not exist two disjoint open 
sets U, V C R" such that (7DC/0, b fl C ^ 0, and C C U U V. Recall 
that any interval in R as well as any R" are connected. 

A sequence {pk}kLo of points in R” is a Cauchy sequence if for every 
£ > 0 there is a number N such that d(pi,pj) < £ for every i,j > N. Every 
Cauchy sequence in R" has a limit point. This fact serves as a basis for 
the proof of the Baire category theorem, which follows. 

Theorem 3.3.2 (Baire category theorem) If N 0 , Ni,N 2 , ... is a sequence 
of nowhere- dense subsets of R" then its union (Jtlo has an empty inte- 
rior. 


EXERCISES 

1 Define the product of two real numbers using Dedekind’s definition of 
R. Show that your product has the distributive property, that is, that 
a(b + c) = ab + ac for every a, b, c G R. 

2 Prove that the relation E defined by (3.5) is an equivalence relation. 

3 Show that < on [0, 1] defined by (3.6) is well defined and is a linear-order 
relation. 

4 Prove that < on R defined by (3.7) is a linear-order relation. 
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Chapter 4 


Well orderings and 
transfinite induction 

4.1 Well-ordered sets and the axiom of foun- 
dation 

A binary relation R on a set X is said to be well founded if every nonempty 
subset Y of X has an f?-minimal element, that is, if 

VY C X [Y ^ 0 — > 3m € Y ~^3y g Y ( yRm)\ . 

Examples 1. For every finite linearly ordered set (X, <) the relation < is 
well founded. 

2. The relation < on the set N of all natural numbers is well founded. This 
is known as the well-ordering principle for the natural numbers. It can be 
easily deduced from the principle of mathematical induction P3. (See also 
(3.2).) It is also a special case of the next example. 

3. The relation g is well founded on every nonempty set X. This, however, 
does not follow from the axioms we have studied so far. For this we need 
the following axiom known as the axiom of foundation or regidarity. This 
is the last of the axioms of ZFC. 

Foundation axiom (Skolem 1922; von Neumann 1925) Every nonempty 
set has an g-minimal element: 

\/x [3 y(y g &)— >3 y[y g x & Sz(z gifezg y)]] . 

This axiom is not essential for this course. However, it is good to know 
some of its basic consequences. 
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Theorem 4.1.1 x fL x for every x. 

Proof To obtain a contradiction assume that there exists a t such that 
t £ t. Put x = {f}. We will see that the axiom of foundation fails for x. 

Evidently, there exists a y £ x, since t € x. Moreover, if y £ x then 
y = t. But then, there exists a z, z = t, such that z = t £ x and 
z = t £ t = y, contradicting the axiom of foundation. □ 

Theorem 4.1.2 There is no infinite £-decreasing sequence, that is, there 
is no sequence (x n : n £ N) such that x n +i £ x n for all n £ N. 

Proof Otherwise the set {x n : n £ N} would have no ^-minimal element. 

□ 


A binary relation < on X is a well-ordering relation if ( X , <) is linearly 
ordered and the relation < is well founded. In other words, a linearly 
ordered set (X, <) is well ordered if every nonempty subset A of A has a 
smallest element a £ A, usually denoted by min A. 

It is pretty easy to see that a linearly ordered set (X, <) is well ordered 
if and only if it does not contain an infinite strictly decreasing sequence, 
that is, a sequence (x n £ X : n £ N) such that x n +\ < x n for every 
n £ N. However, a formal proof of this fact requires a recursive definition 
technique, which is still not available to us. Thus its proof will be postponed 
until Section 4.3 (see Theorem 4.3.2). 

Notice also that a subset of a well-ordered set is well ordered. 

Examples 1. Every finite linearly ordered set is well ordered. 

2. The set N of all natural numbers is well ordered by the usual order <. 

3. The set S = {1 — 1/n: n £ N, n > 0} U {1} is a well-ordered subset of 
(R, <) since S' 0 = {1 — 1/n: n € N, n>0}is ordered the same way as N 
and 1 is greater than any number from S'o- 

4. The interval [0, 1] with the usual order is linearly ordered and has the 
smallest element. However, it is not well ordered, since its subset (0, 1) 
does not have a smallest element (0 is not an element of (0, 1)). 

5. Any family of sets linearly ordered by the relation 

xey <t=> x £ y V x = y 

is well ordered by e. This follows from the axiom of foundation. 

The partially ordered sets (X, <) and (Y, A) are (order) isomorphic (or 
have the same order type) if there is a bijection / : X — > Y such that 


a < b if and only if f(a) A f(b) 



4-1 Well-ordered sets and the axiom of foundation 


39 


for every a,b £ X. The function / is called an order isomorphism (or just 
isomorphism) between X and Y. 

The next two propositions list some basic properties of order isomor- 
phisms. Their proofs are left as exercises. 

Proposition 4.1.3 Let (X,<), (F, A), and (Z, <C) be partially ordered 
sets. 

(1) If f: X — > Y is an order isomorphism between X and Y then the 
inverse function f : Y — > X is an order isomorphism between Y 
and X. 

(2) If f: X — > Y is an order isomorphism between X and Y and g: Y — > 
Z is an order isomorphism between Y and Z then g o / : X — > Z is 
an order isomorphism between X and Z . 

Proposition 4.1.4 Let f : X — > Y be an order isomorphism between par- 
tially ordered sets (X,<) and (Y,X). Then 

(1) X is linearly ordered (well ordered) if and only ifY is linearly ordered 
(well ordered); 

(2) x £ X is the smallest (the greatest, a minimal, a maximal) element 
of X if and only if f(x) is the smallest (the greatest, a minimal, a 
maximal) element ofY. 

Examples 1. The intervals [0, 1] and (0, 1) are not order isomorphic since 
[0, 1] has a smallest element and (0, 1) does not have one. The set ('P(R), c) 
is isomorphic to neither of these sets, since it is not linearly ordered. 

2. The sets N of natural numbers and P of even natural numbers have the 
same order type. The isomorphism is established by the function /(n) = 
2 n. The order type of this set is denoted by u> and represents an infinite 
strictly increasing sequence. (In Section 4.2 we will make the notion of 
“order type” of well-ordered sets more formal.) 

3. The order type of the set S = {1 — 1/n: n £ N, n > 0} U {1} is denoted 
by to + 1 and it is not the same as to, since S has a greatest element and N 
does not. 

4. The set {— n: n £ N} ordered by < is isomorphic to (N, (<) _1 ) = (N, >) 
by the isomorphism f{n) = —n. The order type of this set is denoted by 
to* . The sets with order type u>* are infinite strictly decreasing sequences. 

A subset S' of a partially ordered set (X, <) is an initial segment of X 
if for every x, y £ X 

x<ySzy£S^x£S. 
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An initial segment S of X is proper if S ^ X. It is easy to see that if 
/: X — > Y is an order isomorphism between ( X , <) and ( Y , A) then S C X 
is a (proper) initial segment of X if and only if f[S] is a (proper) initial 
segment of Y . 

Note that for every element xq of a partially ordered set (X, <) the set 
0(x o) = {x € X : x < £ 0 } 

is a proper initial segment of X. However, not every proper initial segment 
of a partially (even linearly) ordered set is of the form O(:ro), as is shown 
by the subset (— 00 , 0] = {x £ R: x < 0} of R. On the other hand, such an 
example cannot be found in a well-ordered set, as is asserted by the next 
theorem. 

Theorem 4.1.5 Every proper initial segment S of a well-ordered set W is 
of the form 0(£) for some £ £ W. 

Proof Since S' is a proper subset of W the set W \ S is nonempty. Let 
f = min {W \ S). Then S = 0 (£) . ' □ 


Theorem 4.1.6 (Principle of transfinite induction) If a set A is well- 
ordered, B C A, and for every x £ A the set B satisfies the condition 

O(x) CB => xeB, (4.1) 


then B = A. 


Proof It is enough to show that A \ B = 0. To arrive at a contradiction 
assume that A \ B ^ 0 and let x — min (A\B). Then, by minimality of x, 
O(x) C B. Hence, by (4.1), x £ B. This contradicts x £ A\B. □ 

If we apply Theorem 4.1.6 to A = N then we obtain the following version 
of the induction schema for the set of natural numbers. 

Corollary 4.1.7 If B C N and for every n £ N the set B satisfies the 
condition 


n C B => n £ B, 


(4.2) 


then B = N. 


The statement of Corollary 4.1.7 also follows immediately from the in- 
duction property (3.2). The converse implication is also easy to prove and 
can be concluded directly, without the use of Theorem 4.1.6. 
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Proofs using Theorem 4.1.6 are called proofs by transfinite induction. 
Such proofs are the basic techniques of set theory and will be used as a main 
tool in this course. The proof of the next theorem is a classic example of 
the use of this technique. To formulate it we need the following definition. 

Let ( A , <) and (B, <} be partially ordered sets. A function f:A—>B 
is an increasing function if x < y implies f{x) < f(y) for every x,y £ A; 
f is strictly increasing if x < y implies f{x) < f(y). Similarly we define 
decreasing and strictly decreasing functions. 

Notice that if / : A — > B is strictly increasing then it establishes an 
order isomorphism between A and f[A], 

Theorem 4.1.8 If f : A — > A is a strictly increasing function defined on 
a well-ordered set A then x < f{x) for all x £ A. 

Proof Let B = {x £ A: x < f(x)}. We will show that B = A. Let x £ A 
be such that 0(x) C B. By Theorem 4.1.6 it is enough to show that x £ B. 
So take y £ 0(x). Then y < f(y). Moreover, f{y) < f(x), since y < x. 
Hence y < f(x) for every y £ 0(x). In particular, f(x) € A\ O(x). Since 
x is the first element of A \ O(x) we conclude that x < f(x). Thus x £ B. 
□ 

Corollary 4.1.9 If well-ordered sets A and B are isomorphic, then the 
isomorphism f : A — > B is unique. 

Proof Let g: A — > B be another order isomorphism. We will show that 
f = g. So let x £ A. The function g -1 o / : A — > A is also an isomorphism. 
Using Theorem 4.1.8 applied to g~ x o f we conclude that x < g -1 (/(a:)) 
and, applying function g to both sides, g{x) < f(x). Similarly we argue for 
f(x ) < g(x). Thus f(x) = g{x). Since x was arbitrary, / = g. □ 


Corollary 4.1.10 No well-ordered set is isomorphic to any of its proper 
initial segments. 

Proof If a well-ordered set A were isomorphic to an initial segment O(x) 
for x £ A, and /: A — * 0(x ) were an isomorphism establishing this fact, 
then we would have f(x) £ O(x), that is, f(x) < x, contradicting Theo- 
rem 4.1.8. □ 


Corollary 4.1.11 No two distinct initial segments of a well-ordered set 
are isomorphic. 
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Theorem 4.1.12 If A and B are well-ordered sets then either 

(i) A and B are isomorphic, 

(ii) A is isomorphic to a proper initial segment of B, or 

(iii) B is isomorphic to a proper initial segment of A. 

Proof Let 

Z = {x £ A: 3y x £ B ( 0(x ) and 0{y x ) are order isomorphic)}. 

Notice that if x £ Z then, by Corollary 4.1.11, y x from the definition of 
the set Z is unique. Define f: Z — > B by putting /( x) = y x , that is, 

/ = {{x,y) £ Ax B: 0(x) and O(y) are order isomorphic}. 

First we will show that Z is an initial segment of A. To see it, let 

x < z and z £ Z. We have to show that x £ Z. Since z £ Z , there is an 

isomorphism h between O(z) and 0(f(z)). The restriction of h to 0(x ) 
gives an isomorphism, whose range must be a proper initial segment S of 
0(f(z)), and, by Theorem 4.1.5, S = 0(y) for some y £ 0(f(z)). Thus 
x £ Z and we conclude that Z is an initial segment of A. 

This argument shows also that f is strictly increasing, since x < z and 
z £ Z imply that 0(x) is isomorphic to 0(y) for some y £ 0(f(z)), that 
is, f(x) = y< f{z). 

Similarly as for Z we can also prove that f[Z) is an initial segment of B. 
(Just note that f[Z\ = {y £ B: 3x £ A (0(y) and 0(x) are isomorphic)}.) 
So /: Z — > f[Z] is an order isomorphism. 

If Z = A then / establishes an isomorphism between A and either B or 
a proper initial segment of B. Similarly, if f[Z] = B then f~ 1 shows that 
B is isomorphic to either A or a proper initial segment of A. Thus, it is 
enough to prove that either Z = A or f[Z\ = B. 

To obtain a contradiction assume that Z ^4 A and f[Z\ yf B . Then Z 
is a proper initial segment of A and f[Z\ is a proper initial segment of B. 
Hence, by Theorem 4.1.5, Z = O(x) for some x £ A and f[Z\ = 0(y) for 
some y £ B. But then, / is an isomorphism between Z = O(x) and 0(y), 
contradicting the fact that x qL Z. □ 

We will finish this section with the following construction of an ordering 
relation on a product space. 

Theorem 4.1.13 Let {F t }t<=T he an indexed family of nonempty linearly 
ordered sets ( F t , < t ) and let T be well ordered by <. If A is a binary relation 
on the Cartesian product P = liter defined by the formula 

f A g O f = g V 3s (s = min{f € T : /(f) ± g(t)} & f(s) < s g(s)) 
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for every f,g £ P, then < is a linear order on P. 

Moreover, if T is finite then P is well ordered by ^ if and only if all 
sets F t are well ordered. 

The proof is left as an exercise. 

The relation ^ defined in Theorem 4.1.13 is called the lexicographic 
order of Ylter and is sometimes denoted by <i ex . 

EXERCISES 


1 Prove Proposition 4.1.3. 

2 Prove Proposition 4.1.4. 

3 Let /: X — > Y be an isomorphism between partially ordered sets ( X , <) 
and (Y, X). Show that S C X is a (proper) initial segment of X if and only 
if f[S] is a (proper) initial segment of Y. 

4 Complete the proof of Theorem 4.1.5 by showing that S = 0(£). 

5 Prove Theorem 4.1.13. 

6 Show that in the additional part of Theorem 4.1.13 the assumption that 
T is finite is essential by giving an example of well-ordered sets T and F 
for which the lexicographic order on F T is not a well-ordering relation. 

7 Let ( K , <) and ( X , <) be nonempty linearly ordered sets such that X has 
a fixed element 0 £ I. For /: K — » X let supp(/) = {k € K: f(k) ^ 0} 
and put 

F(K,X ) = {/ e X K : supp(/) is finite}. 

Define a binary relation -< on F{K,X) by putting 

f ^ g f = g or /(to) < g(m), where to = maxjfc £ K: f(k) ^ g(k)}, 
for every f,g £ F(K,X). Prove that 

(a) the relation ^ is a linear-order relation on F(K,X); 

(b) if X has at least two elements and 0 is the minimal element of X 
then {F(K,X),<) is well ordered if and only if (X,<) and (K,<) 
are well ordered. 

The relation ^ is usually called an antilexicographic- order relation. 
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4.2 Ordinal numbers 

In the previous section we were informally talking about the “order type” 
of a given partially ordered set (X, <). The natural way of thinking about 
such an object is to consider it as a representative from the equivalence 
class [X]< of all ordered sets order isomorphic to ( X , <). This equivalence 
class, however, is not a set! (The argument is similar to that for the class 
of all sets.) Moreover, if we want to talk about all order types, we find also 
that there are too many of them; that is, even if we discover a good way of 
choosing a representative from each equivalence class [X]<, the class of all 
such representatives will not be a set. 

One way to avoid this problem is to consider the sentences “(Y, <) has 
the order type r of (X, <)” and “(Y, <} and (X, <) have the same order 
type” just as abbreviations for the well-defined sentence “(Y, <) and (X, <) 
are order isomorphic.” That is the way we will think about the order types 
of arbitrary partially ordered sets. For the well-ordered sets, however, 
we can indeed follow the path of our intuition described in the previous 
paragraph. This section is dedicated to fulfilling this goal by constructing 
the canonical order types of well-ordered sets, called ordinal numbers. The 
construction is due to von Neumann (1929). 

The class of all ordinal numbers is still too big to be a set. Thus, instead 
of constructing it as a set, we will define a formula <p(a) that will represent 
an intuitive sentence “a is an ordinal number.” The definition is as follows. 

A set a is said to be an ordinal number (or just an ordinal) if it has 
the following properties: 

(A a ) If (3 £ a then j3 C a; 

(B a ) If /3 , 7 G a then /3 = 7 , /3 G 7 , or 7 G /3; 

(C a ) If 0 yf B C a then there exists a 7 € B such that 7 fl B = 0. 

It follows from Theorem 3.1.2 that if a stands for either any natural 
number or the set N of natural numbers then a satisfies conditions (A a ) 
and (B a ). Also, the well-ordering principle for the natural numbers implies 
condition (C a ) in this case. So every natural number is an ordinal number 
and so is N. It is also easy to see that the set N U {N} is an ordinal 
number. The sets N and NU{N}, when considered as ordinal numbers, are 
traditionally denoted as uj and u> + 1 , respectively. 

We will start with the following basic property. 

Theorem 4.2.1 If a is an ordinal number and rj £ a then rj is also an 
ordinal number. 

Proof Assume that a satisfies (A a )-(C a ) and let rj € a. We have to show 
that 77 satisfies (A^)-(C r) ). 
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First notice that (A Q ) implies 77 C a. To see (C^) let 0 yf B C 77. Then 
l/BCa and (C v ) follows from (C Q ). To see (B^) take /3,j G 77. Then 
(3, -y G a and (B^) follows from (B Q ). 

Finally, to see (A,,) let (3 G 77. We have to show that (3 C 77. So choose 
£ G (3. The proof will be finished when we show that £ G 77. But f3 G r] C a. 
So, by (A a ), (3 C a and £ G a, since £ € /3. We have proved that £,77 € a. 
Now, using (B a ), we conclude that £ € 77, £ = 77, or 77 € £. If £ € 77 then we 
are done. We will prove that the other two cases are impossible. 

To obtain a contradiction, assume first that 77 € £. Then, in particular, 
77 € £ € f3 € 77. Put 13 = {£,/?, 77}. Then I / B C a. But then (C a ) fails 
for this 13, since 77 € £nI3, £ € /3fli3, and /3 € 77 D -B, that is, the sets £fl 13, 
(3 D 33, and 77 D 13 are nonempty. 

So assume £ = 77. Then 77 = £ G (3 G 77. Choosing i3 = {/?, 77} gives a 
contradiction similar to that in the previous case. □ 

Now notice that for an ordinal number a and for arbitrary (3 , 7 G a 

7 C (3 if and only if 7 = (3 V 7 € (3. (4.3) 

The implication <= follows immediately from (A^), which holds by The- 
orem 4.2.1. To see the other implication let 7 C (3. Since (3 , 7 G a, condition 
(B q ) implies that 7 = (3, 7 G /3, or (3 G 7. If either of the first two condi- 
tions holds, we are done. But if (3 G 7 then, by (A 7 ), (3 C 7, and combining 
this with 7 C (3 we obtain 7 = f3. 

Theorem 4.2.2 If a is an ordinal number then the relation C is a well- 
ordering relation on a. 

Proof Inclusion C is clearly a partial-order relation on any family of sets, 
so it is on a. It is a linear order on a by (B a ) and (4.3). 

To see that a is well ordered by C, let 0 yf i3 C a and let 7 € 13 be 
as in (Cq.), that is, such that 7 n 13 = 0. We will show that 7 is minimal 
with respect to C. So choose arbitrary (3 € 13. Then, by (B a ) and (4.3), 
either 7 C (3 or (3 € 7. But (3 € 7 implies 7 ft i3 0, which is impossible. 
Thus 7 C (3. Since (3 was an arbitrary element of B we conclude that 7 is 
a minimal element in B. □ 

It is customary that the relation C on an ordinal number is denoted 
by <. It is also easy to see that, according to our general agreement, the 
relation < on an ordinal number is identical to G. 

Notice also that by Theorem 4.1.5 every proper initial segment of an 
ordinal number a is of the form 0(/3) for some (3 G a and that 0(/3) = (3. 
Thus every initial segment of an ordinal number is an ordinal number. 

The basic properties of ordinal numbers are as follows. 



46 


4 Well orderings and transfinite induction 


Theorem 4.2.3 For every ordinal numbers a and [3 

(i) if a and j3 are order isomorphic then a = (3; 

(ii) a = (3 , a < (3 , or j3 < a; 

(iii) if T is a nonempty family of ordinal numbers then (J F is also an 
ordinal number. 

Proof (i) Let f: a — * (3 be an order isomorphism and let 

Z = { 7 G a: /( j) = 7}. 

We will show, by transfinite induction, that Z = a. This will finish the 
proof, since Z = a implies that / is the identity function on a, and thus 
a = f[a] = (3. 

To see that Z = a let 7 € a be such that 0( 7) C Z. Then f[0( 7)] = 
0(f( 7)) and f(S) = 6 for all S < 7. Hence 

/( 7) = 0(f( 7)) = f[0( 7)] = {/(<$): 8 < 7} = {8: 8 < 7} = 7. 

Thus we have proved that for every 7 £ a condition 0(7) C Z implies that 
7 £ Z. Hence Z = a. Condition (i) has been proved. 

(ii) By Theorem 4.1.12 either a and (3 are isomorphic, or a is isomorphic 
to a proper initial segment of (3, or (3 is isomorphic to a proper initial 
segment of a. By (i), this implies (ii). 

(iii) We have to check that 8 = satisfies (Ag)-(Cg). 

(A$): If (3 € IJ-A then there is an a € T such that (3 € a. Then 
(3 C a C |J T. 

(B$): If /?,7 € then there are 07,0:2 € T such that (3 £ a 1 and 
7 € 02- By (ii) either oi < 02 or 02 < ot\. Let a be the greater of these 
two. Then /1,76 a and, by (B a ), (3 = 7, (3 £ 7, or 7 G (3. 

(Ca): Let 0 yf B C \j3F and let a € B. Then there is an p G J 7 such 
that a £ 77. Hence, by Theorem 4.2.1, a is an ordinal. If a D B = 0 put 
7 = o. Otherwise, put 7 = min(o O H), which exists by (C Q ). It is easy to 
see that 7 n B = 0. □ 

It is not difficult to see that for an ordinal number a the set a U {o} is 
also an ordinal number. We will denote it by o + 1 and call it the (ordinal 
immediate) successor of a. Number a is also called the (ordinal immediate) 
predecessor of o+l. It is also easy to check that o+l is the smallest ordinal 
number greater than o. Thus every ordinal number has its successor. Not 
every number, however, has an immediate predecessor; for example, 0 and 
u> do not have one. The ordinal numbers that have immediate predecessors 
(i.e., those in the form a + 1) are called ordinal successors. Those that do 
not have immediate predecessors are called limit ordinals. 
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The next theorem justifies our intuition of considering ordinal num- 
bers as representatives of the abstract classes of all order-isomorphic well- 
ordered sets. 

Theorem 4.2.4 For every well-ordered set (W, <) there exists precisely 
one ordinal number a that is order isomorphic to IT. 

Proof The uniqueness of a follows immediately from Theorem 4.2.3(i). 
For w £ IT let 0[w\ = 0(w ) U {ic} and define 

Z = {w £ IT : 0[ie] is order isomorphic to some ordinal number a w }. 

Notice that by Theorem 4.2.3(i) and Corollary 4.1.9 the ordinal number 
a w and the isomorphism f w : 0[w\ — > a w are unique. 

Now, if v,w £ Z and v < w then f w [0[v]\ is an ordinal number a, 
as an initial segment of a w . In particular, f w \o[ v ] '■ 0\v\ — > a is an order 
isomorphism and, by the preceding uniqueness remark, f u ,\o[v = fv So 
we have shown that 

fv C f w for every v, w £ Z, v < w. (4-4) 

We will prove, by transfinite induction, that Z = W. 

So let 0(w) C Z. Then, by (4.4), / = U«eo(iu) f v a strictly increasing 
function from LUo(™) ont o U„ £ o(») a v Bu t U u6 o(^) °i v \ = °0) 
and, by Theorem 4.2.3(iii), a = U„go(u;) av i s an ordinal number. So 
/: 0(w) — > a is an order isomorphism. Extend / to a function F: 0[u>] — » 
a + 1 by putting F(w) = a. Then the existence of F proves that w £ Z. 
We have shown that Z = W. 

Now, by (4.4), the function / = f w i s an isomorphism between 

W = \J weW 0[w] and an ordinal number {J w( , w a w . □ 

In what follows it will be convenient to use the following theorems. 

Theorem 4.2.5 If a and (3 are ordinal numbers and f: a — > (3 is a strictly 
increasing function then a < (3. Moreover, £ < /(£) for every f < a. 

Proof If (3 < a then (3 C a and / is a strictly increasing function from a 
into a. But then f(/3) £ (3, that is, f{(3) < (3, contradicting Theorem 4.1.8. 
So, by Theorem 4.2.3(h), a < (3. 

The additional part follows from Theorem 4.1.8. □ 

For a well-ordered set W let Otp(W) stand for the order type of W, 
that is, the unique ordinal number that is order isomorphic to IT. 

Corollary 4.2.6 IfW is well ordered and B C IT then Otp (B) <Otp(IT). 
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Proof Let /: Otp(B) — > B and g: W — » Otp(IT) be order isomorphisms. 
Then go/: Otp(S) — > Otp(W) is strictly increasing, and by Theorem 4.2.5 
Otp (B) < Otp(W). □ 

We will also introduce the following arithmetic for ordinal numbers. For 
ordinal numbers a and (3 we define the sum a + (3 of a and (3 as the order 
type of the well-ordered set ({0} x a) U ({1} x (3) ordered by 

(*> 0 V (i = j & £ < <)■ 

Thus, we “append” (3 to a. It is easy to see that the relation so defined is 
indeed a well-ordering relation. 

Similarly, we define the product a/3 of ordinal numbers a and f3 as the 
order type of the set (3 x a ordered lexicographically. By Theorem 4.1.13 
it is well ordered. 

It is not difficult to see that, in general, a + (3 ^ (3+ a and a/3 yf (3a (see 
Exercise 1). We have the following monotonic laws for these operations. 

Theorem 4.2.7 For arbitrary ordinal numbers a, /3,7 

(a) a < (3 implies 7 + a < 7 + (3; 

(b) a < (3 implies a + 7 < (3 + 7; 

(c) a < (3 and 7 > 0 imply 7a < 7 (3; 

(d) a < (3 implies 07 < (3j. 

Moreover, the inequalities < in (b) and (d) cannot be replaced by < even 
if a < (3. 

Proof To see (a) let /: ({0} x 7)U({1} x a) — > ({0} x 7) U({1} x (3) be the 
identity map. Notice that / is an isomorphism between ({0} x 7)U({1} x a) 
and the initial segment 0((1, a)) of ({0} x 7) U ({1} x (3). Thus 7 + a is 
isomorphic to a proper initial segment of 7 + (3, so 7 + a < 7 + (3. 

To see (b) let /: ({0} x a) U ({1} x 7) — » ({0} x (3) U ({1} x 7) be the 
identity map. Notice that / is strictly increasing. Thus, via an appropriate 
isomorphism, / can be transformed into a strictly increasing function from 
a + 7 into (3 + 7. But then, by Theorem 4.2.5, a + 7 < (3 + 7. 

To see that equality can hold even if a < (3, notice that 1 +co = u> = 0+o> 
(see Exercise 1). 

Parts (c) and (d) are left as exercises. □ 

For an ordinal number a any function a on a is called a transfinite 
sequence and is usually denoted by or (a^: f < a), where oj = 

a(£). If a is a domain of a transfinite sequence a then we also often say 
that a is an a-sequence. 
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EXERCISES 

1 Show that for ordinal numbers a and (3 we might have a + /3//3 + a 
and a/3 y (3a by proving that 

(a) l+w = w,sol + w/ w+ l; 

(b) 2u> = u>, lu 2 = u + u), and w/w + w. 

2 Prove parts (c) and (d) of Theorem 4.2.7. 

4.3 Definitions by transfinite induction 

Most mathematicians take for granted that if we have a procedure asso- 
ciating a number a n +\ to a number a n and we have defined ao, then the 
sequence {a n }neN constructed in such a way exists. The proof of this fact 
for a finite portion {a n } n < m , m € N, of such a sequence does not require 
any special axioms. The proof of existence of the infinite sequence {a„} n6 N, 
however, requires some form of argument, and the axiom of choice is needed 
to prove any procedure justifying such an argument. (For example, if we 
stay only within the framework of Peano arithmetic, we have no way to 
prove the existence of an infinite object {a n }neNj since we operate in a 
world with finite objects only.) This argument is given in the next theo- 
rem. 

Theorem 4.3.1 (Recursion theorem) Let Z be a set, a be an ordinal num- 
ber, and T be the family of all £- sequences for C < a with values in Z, that 
is, T = U^< a ■ Then for each function h: T — > Z there exists precisely 
one function f: a — > Z such that 

/( 0 = M/ld f or aU £<(*■ (4-5) 

The function h from the theorem tells us how to choose /(£) knowing 
the values of the sequence (/(C) : C < C) chosen so far. Because of this, we 
will sometimes call it an “oracle” function. 

Proof of Theorem 4.3.1 First we prove the uniqueness of /. So let g 
be another function satisfying (4.5) and let W = {£ < a\ /(£) = g(£)}- 
It is enough to show that W = a. But if £ = 0(C) C W then / 1| = g\$ 
and /(C) = h(f ||) = h(g |{) = g( /)■ Hence C € W, and by a transfinite 
induction argument W = a. The uniqueness of / has been proved. 

To prove the main body of the theorem assume, to obtain a contradic- 
tion, that the theorem fails for some ordinal number a. We may assume 
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that a is the smallest ordinal number for which the theorem fails, since if 
it is not, we can replace it by the minimum 

min{/3 < or. theorem fails for f3}. 

Let h be a function for which the theorem fails for ordinal a. For every 
£ < a there is a function : £ — » Z such that 

fd'y) = M/il 7 ) for a11 7 < £• (4-6) 

But for any £ < £ < a the function also satisfies (4.6). Thus, by the 
uniqueness part that we have already proved, ,/yd = ft- particular, 

fd'y) = fd'y) for ever y 7 < £ < C < «■ 

Now, if a is a successor ordinal, say a = (3 + 1, then define / : a — > Z 
as an extension of fp by putting f(/3) = h(fp). Then / satisfies (4.5) since 
/I? = fp\t ^ or ever y £ < ot. This contradicts the choice of a. 

Thus assume that a is a limit ordinal. Then £ + 1 < a for every £ < a. 
Define / : a — > Z by 


/(£) = /«+i(£) for £ < a 
and notice that such a function satisfies (4.5), since 

/(£) = /«+i(£) = M/f+ik) = M/lc) 

for every £ < a. This contradicts the choice of a. □ 

Most commonly, the oracle function h is defined as a choice function C 
on some family of sets. In such a case a particular value /t(£) is chosen as a 
value of C on some set, whose definition depends on the previously chosen 
values of /, that is, it is of the form 

M P) = C d z e <P(2,P)}), 

where C: V(Z) — » Z is a choice function and tp(z,p) is some formula. Of 
course, such a function h is not defined for the values of p £ T for which 
the set {z £ Z : <p(z,p)} is empty. Formally, to use the recursion theorem 
we have to define it for these values of p as well; however, in most cases we 
are not interested in such values anyway. Thus, we usually assume that h 
is arbitrarily defined somehow for such values and we don’t even bother to 
mention it. (The same is true for the other values for the oracle function 
in which we are not interested.) Also, it is customary in such situations to 
simply write that we choose in the inductive step 


/(£) G {z £ Z: <p{z,f\d} 
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or simply 

/(£) = z such that 2 satisfies ip(z, /|^) 

without mentioning explicitly the oracle function at all. 

In the applications using the recursion theorem we often separately 
specify the value of the function h for the empty sequence by h(0) = A and 
for successor ordinals £ + 1 by h(<j>) = F(<j>(£)) for some function F £ Z z , 
where f> £ Z^ + K Thus the value of li on successor ordinal £ + 1 depends 
only on the value </>(£) of </> on its last element f. The a-sequence / obtained 
that way satisfies the conditions 

/( 0) = A, /(£ + 1) = F(f(0) for any ordinal £ < a, 

/(A) = h(f\ x ) for any limit ordinal A < a. 

Notice that to define the function / only on a = oj we don’t need the 
very last condition. Then the first two conditions can be rewritten as 

/(0) = A, f(n+l) = F(f(n)) for any n<u. 

The existence of such a sequence is usually deduced from the standard (not 
transfinite) recursion theorem. 

There is probably no need to explain to the reader the importance of 
this last theorem. Without this theorem it is sometimes hard to prove even 
very intuitive theorems. As an example we can state here the following 
characterization of well orderings. 

Theorem 4.3.2 A linearly ordered set ( X , <} is well ordered if and only 
if it does not contain a subset of order type co* (i.e., a strictly decreasing 
uj-sequence). 

Proof =*-: If there exists a strictly decreasing sequence (x n : n < w) then 
the set {x n \ n < uj} does not have a smallest element. 

<=: Assume that X is not well ordered and construct, by induction on 
n < u>, a strictly decreasing sequence (x n : n < to). The details are left as 
an exercise. □ 

The next theorem is one of the most fundamental theorems in set theory. 

Theorem 4.3.3 (Well-ordering or Zermelo’s theorem) Every nonempty 
set X can be well ordered. 

Proof Let C be a choice function on the family V(X) \ {0}. Choose an 
arbitrary p ^ X, for example, p = X, and define F : V(X U {p}) — > X U {p} 

by 

F{Z ) = [ C{Z) for z e V(X) \ {0}, 

1 p otherwise. 



52 


4 Well orderings and transfinite induction 


Let Q be the family of all ordinal numbers / such that there is a one- 
to-one ^-sequence in X. Notice that Q is a set by the replacement axiom, 
since 

Q = {Otp((Y, <)): Y £ V(X) & <€ V(X x X) & (Y, <) is well ordered}. 

Let a be an ordinal number that does not belong to Q. For example, 
take a = (U f?) + 1. By the recursion theorem there exists a function 
/ : ft-tlU {p} such that 

m = F(X\{f{Q: C<C» (4-7) 

for all £ < a. (The function h is defined by h(s) = F(X\range(s)).) Notice 
that for every (3 < a 

if f[/3\ C X then /Ip is one-to-one. (4.8) 

This is the case since, by (4.7), /(£) € X \ {/(C): C < C} f° r C < (3- In 
particular, f[a] <f. X since there is no one-to-one a-sequence in X. So the 
set S = {(3 < a: f((3) = p} is nonempty. Let (3 = min S'. Then / |p is 
one-to-one and it establishes a well ordering of X of type (3 by 

x S y & f^ 1 (x) < f^ 1 (y). □ 

Remark It is clear that we used the axiom of choice in the proof of The- 
orem 4.3.3. It is also easy to prove in ZF that Theorem 4.3.3 implies the 
axiom of choice. 1 To argue for this it is enough to show that Theorem 4.3.3 
implies the existence of a choice function for every family T of nonempty 
sets. But if < is a well ordering of (J T then the function f(F ) = min F 
for F £ T is a choice function defined only with help of the comprehension 
schema. 

The combination of the recursion and well-ordering theorems gives a 
very strong proving technique. However, for its use usually some kind of 
cardinal argument is needed, a tool that we still have not developed. Thus, 
the direct use of this technique will be postponed until subsequent chapters. 
On the other hand, the next theorem will give us a way to prove the results 
that require some kind of transfinite induction argument without doing it 
explicitly. To state it we need the following definitions. 

For a partially ordered set (P, <) we say that a set S C P is a chain in 
P if S is linearly ordered by <. An element b € P is said to be an upper 
bound of a set S C P if s < b for every s £ S. Similarly, b £ P is a lower 
bound of S C P if b < s for every s £ S. 

1 ZF stands for the ZFC axioms from which the axiom of choice AC has been removed. 
See Appendix A. 
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Notice that not all subsets of a partially ordered set must have upper 
or lower bounds. For example, (0, oo) C R does not have an upper bound 
and every number r < 0 is a lower bound of (0, oo). 

Let us also recall that m £ P is a maximal element of (P, <) if there is 
no p £ P such that to < p. 

Theorem 4.3.4 (Zorn’s lemma 2 ) If (P, <) is a partially ordered set such 
that every chain in P has an upper bound then P has a maximal element to. 

Proof The proof is similar to that of Theorem 4.3.3. Let C be a choice 
function on the family P(P)\{0}. Choose an arbitrary x ^ P, for example, 
x = P, and define F : P(P U {x}) — » P U {x} by 

F(Z)=f C(Z) forZe? (P)\{ 0 }’ 

1 x otherwise. 

Let Q be the family of all ordinal numbers £ such that there is a strictly 
increasing sequence {pc '■ C < 0 i n P- Let a be an ordinal number that 
does not belong to Q. For example, take a = ((J Q) + 1. 

Define a function h on the family of all ^-sequences (p^ '■ C < 0 in 
P U {x}, £ < a, by putting 

H(p C :C<0)=HB\{p C :C<f}), (4.9) 

where B = {b £ P: b is an upper bound of {p^: ( < £}}. (Notice that 
B = 0 if {p^: <C < (£_ P.) By the recursion theorem, there is a function 

/ such that 

f(0 = Hfk) for all £ < a. 

Notice that for every /3 < a 

if f\j3\ C P then f\p is strictly increasing. (4-10) 

This is the case, since then, by (4.9), /(£) £ B \ {p^ : ( < for £ < /3. In 
particular, f[a] fL P since there is no strictly increasing a-sequence in P. 
So the set S = {/3 < a: f{(3) = x} is nonempty. Let (3 = min S'. Then f\p 
is strictly increasing, so B = {b £ P: b is an upper bound of {p^: ^ < /?}} 
is nonempty, {p^ : £ < (3} being a chain. Let to £ B. It is enough to argue 
that to is a maximal element of P. But if there is a y £ P with y > to, 
then y > m > p£ for every £ < j3. In particular, y £ B \ {p^: £ < /?}, so 

2 M. Zorn proved this lemma in 1935 and published it in Bulletin AMS. The same theo- 
rem was proved in 1922 by K. Kuratowski and published in Fundamenta Mathematicae. 
Thus priority for this theorem belongs without any doubt to Kuratowski. However, in 
essentially all published sources the name Zorn is associated to this theorem and it seems 
that the battle for historical justice has been lost. 
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B \ {pj: £ < /?} 7 ^ 0. Thus, by (4.9), f((3) G P , contradicting the choice 
of /?. ' □ 

Zorn’s lemma is used most often in the situation when P is a family of 
subsets of a set A ordered by inclusion and such that for every chain S C P 
its union IJ S is also in P. (Notice that (J 5 is an upper bound for S with 
respect to C.) In this particular case Zorn’s lemma has an especially nice 
form, stated as the following corollary. 

Corollary 4.3.5 (Hausdorff maximal principle) If T is a nonempty family 
of subsets of a set A such that for every chain S C T its union (J S belongs 
to T then T has an C-maximal element. 

Remark Zorn’s lemma and the Hausdorff maximal principle are equivalent 
to the axiom of choice within the ZF theory. To see this first notice that 
Zorn’s lemma follows from the axiom of choice (Theorem 4.3.4) and that the 
Hausdorff maximal principle follows from Zorn’s lemma (Corollary 4.3.5). 
To see that the Hausdorff maximal principle implies the axiom of choice 
let Q be a nonempty family of pairwise-disjoint nonempty sets. We have 
to find a selector S for Q. So let T be the family of all T C (J Q such that 
TtlG has at most one element for every G G Q. Notice that T satisfies the 
assumptions of the Hausdorff maximal principle. Let S G T be maximal. 
We will show that S' is a selector for Q. But if S is not a selector for Q 
then G D S = 0 for some G G Q. Then there is a g & G and S U {</} is in 
J- . contradicting the maximality of S. 

EXERCISE 

1 Complete the details of the proof of Theorem 4.3.2. 

4.4 Zorn’s lemma in algebra, analysis, and 
topology 

In this section we will see three standard applications of Zorn’s lemma in 
three main branches of mathematics: algebra, analysis, and topology. Each 
of these areas will be represented, respectively, by the theorem that every 
linear space has a basis, the Hahn-Banach theorem, and the Tychonoff 
theorem. Going through these proofs should help the reader to appreci- 
ate the power of Zorn’s lemma and, implicitly, the axiom of choice. The 
Hahn-Banach theorem and the Tychonoff theorem will not be used in the 
remaining part of this text. 

Algebraic application To state the algebraic example let us recall the 
following definitions. A set G with a binary operation + : G x G -> G is a 
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group if + is associative (i.e. , (u + v) + w = u + (v + w) for all u, v, w £ G), 
G has the identity element 0 (i.e., 0 + u = u + 0 = i>for every v £ G), and 
every element of G has an inverse element (i.e., for every v £ G there exists 
— v £ G such that (—v) + v = v + (—v) = 0). A group ( G , +) is Abelian if 
a + b = b + a for every a,b £ G. A set K with two binary operations + 
and • on K is a field, if ( K , +) is an Abelian group with 0 as an identity 
element, (A'\{0}, •) is an Abelian group with 1 as an identity element, and 
a{b + c) = ab + ac for every a,b,c £ K . In what follows we will mainly be 
concerned with the fields (R, +, •) of real numbers and (Q, +, •) of rational 
numbers. 

An Abelian group (V, +) is said to be a linear space (or vector space) 
over a field K if there is an operation from K x V to V, ( k , v) — > kv, such 
that ( k + l)v = kv + Iv, k(v + w) = kv + kw, k(lv ) = ( kl)v , Ov = 0, and 
Iv = v for every k, l £ K and v, w £ V. A subset S' of a vector space V over 
a field K is a linear subspace of V if it is a linear space when considered 
with the same operations. The main examples of vector spaces considered 
in this text will be R" over either R or Q. 

A subset S of a vector space V over a field K is linearly independent 
if for every finite number of distinct elements V \ , . . . ,v n of S and every 
ki, . . . ,k n £ K the condition k\V\ + • • • + k n v n = 0 implies k\ = ■ ■ ■ = k n = 
0. A subset S of a vector space V over a field K spans V if every v £ V 
can be represented as v = k\V\ + • • • + k n v n for some v\,. . . ,v n £ S and 
ki, . . . ,k n £ K. A basis of V is a linearly independent subset of V that 
spans V. 

It is easy to see that if I? is a basis of V then every v £ V has a unique 
representation v = k\V\ + • • • + k n v n , where V\, . . . ,v n £ B are different 
and fci, . . . ,k n £ K (and where we ignore the i>jS for which ki = 0). This is 
the case since if v = l\V\ + • • • + l n v n is a different representation of v, then 

(h - k\)v\ H 1- (l n — k n )v n = v - v = 0 so h - k\ = ■ ■ ■ = l n - k n = 0. 

Thus U = ki for i £ {1, . . . , n}. 


Theorem 4.4.1 If So is a linearly independent subset of a vector space V 
over K then there exists a basis B of V that contains Sq. 

In particular, every vector space has a basis. 

Proof The additional part follows from the main part, since the empty 
set is linearly independent in any vector space. 

To prove the main part of the theorem, let 

tF = {S C V : So C S and S is linearly independent in V}. 

Notice first that T satisfies the assumptions of the Hausdorff maximal 
principle. 
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Clearly T is nonempty, since So £ F . To check the main assumption, 
let Q C T be a chain in T with respect to C. We will show that [JQ is 
linearly independent in V. So let V \ , . . . , v n be different elements of (J Q 
and choose k\, . . . ,k n £ K such that k\V\ + ■ ■ ■ + k n v n = 0. For every 
i S {1,... ,n} let Gi £ Q be such that ry £ G, . Since {G i, . . . , G„} is a 
finite subset of a linearly ordered set Q , we can find the largest element, 
say Gj, in this set. Then ry £ Gi C Gj for all* £ {1, . . . , n}. Hence all zys 
are in a linearly independent set Gj. Thus k\V\ + • • • + k n v n = 0 implies 
k\ = ■ ■ ■ = k n = 0. So we can use the Hausdorff maximal principle on T . 

Let B be a maximal element in T . We will show that it is a basis of V. 
It is linearly independent, since it belongs to T . So it is enough to prove 
that B spans V . 

To obtain a contradiction assume that there is a v £ V such that 


v ^ k\v i H 1- k n v n (4-11) 

for every v\,...,v n £ V and ki , . . . , k n £ K. We will show that this 
implies that B U {z?} is linearly independent in V, which contradicts the 
maximality of B in T . So choose different elements v \ , . . . ,v n from B and 
fc 0 ,... ,k n £ K such that kov + k\V\ + ■ ■ ■ + k n v n = 0. There are two cases. 

Case 1: ko = 0. Then k\V\ + • • • + k n v n = 0 and V\, . . . , v n are from B. 
Hence, also k\ = • • • = k n = 0. 

Case 2: ko ^ 0. Then v — — (ki/ko)v\ + ■ ■ ■ — ( k n /ko)v n , contradicting 
(4.11). □ 

Analytic application For the next theorem the term vector space will be 
used for the vector spaces over the field R of real numbers. Recall that a 
function / is said to be a linear functional on a vector space V if / : V — > R 
is such that f(a x + by) = af(x) + bf(y) for every x, y £ V and a, b £ R. 

Theorem 4.4.2 (Hahn-Banach theorem) Let V be a vector space and 
p: V — > R be such that p(x + y) < p(x) + p(y) and p(ax) = ap(x) for 
all x, y £ V and a > 0. If f is a linear functional on a linear subspace S of 

V such that f(s) < p(s) for all s £ S, then there exists a linear functional 
F on V such that F extends f and F(x) < p(x ) for all x £ V. 

Proof Let T be the family of all functionals g on a linear subspace T of 

V such that g extends / and g(x) < p(x) for all x £ T . First we will show 
that T satisfies the assumptions of the Hausdorff maximal principle. 

So let Q C T be a chain in T with respect to C. We will show that 
(J Q £ T. Clearly / C (J Q. To see that g = (J Q is a functional, notice first 
that g is a function since for (x,So), (x,s i) £ g we can find go,gi € Q such 
that ( x,Si ) £ gi for i < 2. Then, for go C gi, So = go(%) = ffi(x) = si- 
Similarly, if g 1 C go then si = gi(x) = g 0 (x) = s 0 . 



4-4 Zorn’s lemma in algebra, analysis, and topology 


57 


In a similar way we show that the domain of g is a linear subspace and 
that g is a linear functional on its domain. Since also g(x) < p{x) for all 
x £ dom(g), we conclude that g £ T, that is, T satisfies the assumptions 
of the Hausdorff maximal principle. 

Now let F: T — > R. be a maximal element of T . Then F is a linear 
functional extending / such that F(x) < p(x) for every x £ T. It is enough 
to prove that T = V. 

To obtain a contradiction assume that V / T and let w £ V\T. We will 
show that this contradicts the maximality of F by constructing a G £ T 
that extends F and is defined on a linear subspace Vo = Span(T U {w}) = 
{aw + t: a el, tsTjofV. 

First notice that for every s,t £ T 

F(s ) + F(t) = F(s + t) < p(s + 1) = p(s — w + t + w ) < p(s — w) + p(t + w ) 
and so 

— p(s — w) + F(s) < p(t + w) — F(t). 

Hence, in particular, 

sup[— p(t — w) + F(t )] < inf \p(t + w) — F(t)]. 
ter 

Choose a such that sup [—p(t — w) + F(t)] < a < inf \p(t + w) — F{t )]. Then 
a < inf[p(t + w) — F(t)] & — a < inf[p(t — w) — F(t)]. 

(4.12) 

Now for a £ R. and t £ T define 

G(aw + t) = aa + F(t). 

Clearly G is a functional on Vo that extends F. We have to show only that 
G(s) < p(s) for all s £ Vo- So let s = aw + t with t £ T, a £ R. 

If a = 0 then G{aw + t) = F{t) < p(t) = p(aw + t). 

If a > 0 then, by the first part of (4.12), 

G(aw + 1) = aa + F(t) 

= a[a + F(t/a )] 

< a[[p{t/a + w) — F{t/a)]+ F{t/a)] 

= ap(t/a + w) 

= p(t + aw). 
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If a = — b < 0 then, by the second part of (4.12), 


G(aw + t) 


■ 


— ba + F(t) 
b[—a + F(t/b)\ 

b[\p(t/b - w ) - F(t/b)] + F{t/b)\ 
bpft/b — w) 
pit + aw). 


This finishes the proof of Theorem 4.4.2. □ 

Topological application Let us start by recalling some definitions. A 
family T of subsets of a set X is said to be a cover of a set A c X if 
A C (J T . If T is a cover of A then a family Q C T is said to be a subcover 
of T if A c U S- If ( X , t) is a topological space then T is said to be an 
open cover oi A c X \i A c\J F an d X C t. A topological space ( X , r) is 
compact if every open cover of X has a finite subcover. 

For a topological space ( X , r) a family B C r is said to be a base for 
X if for every U £ r and every x £ U there exists a B £ B such that 
x £ B C U. A family S C r is said to be a subbase of topological space 
(X, t) if the family 


B(S) = l p| Si G V{X ) : n e N & Vk n (S,: 6 5) 
li<n J 

forms a base for X. It is easy to see that for any family S of subsets of a 
set X the family 

r(5) = {U?:gc8(5)} 

forms a topology on X. This topology is said to be generated by S. The 
family S is also a subbase for the topology T(«S). 

Finally, let us recall that for a family {(X a ,T a )} ae j^ of topological 
spaces their Tychonoff product is a topological space defined on the product 
set X = rioeA X a with topology generated by the subbase 

{ Pa 1 (U) : a G AX.U G r Q }, 

where p a : X — > X a is the projection of X onto its ath coordinate X a . 

Theorem 4.4.3 (Tychonoff theorem) The product of an arbitrary family 
of compact spaces is compact. 

The proof will be based on the following lemma. 



4-4 Zorn’s lemma in algebra, analysis, and topology 


59 


Lemma 4.4.4 (Alexander subbase theorem) If there is a subbase S of a 
topological space X such that every open cover V C S of X has a finite 
subcover then X is compact. 

Proof Let S be a subbase of X such that any cover V C S of X has a 
finite subcover. Let B = B(S ) be a base for X generated by S and, to 
obtain a contradiction, assume that X is not compact. Then there is a 
subfamily U 0 of B(S ) that covers X but does not have a finite subcover. 
(We use here an elementary fact that for any base B of a topological space 
X, the space X is compact if and only if every cover U C B of X has a 
finite subcover.) 

Define T as 

jw C B: U 0 C U, X C u IA , and there is no finite subcover V C IA of X j-. 

To see that T satisfies the assumptions of the Hausdorff maximal principle 
let Q C T be a chain. If T C (J Q is finite then there is a U £ Q such that 
T C IA. So T cannot cover X, since IA £ IF. Thus (J Q does not have a 
finite subcover, that is, (J Q € T . 

So, by the Hausdorff maximal principle, there is a maximal element 
U £ T. Now let U £ IA. Then U = C\ i<n Vi for some U t £ S. We claim 
that 


there is a j < n such that Uj £ U. (4-13) 

Otherwise, for every i < n we can find a finite Ui C U such that U, U {Ui} 
covers X, that is, Ui covers X\Ui. But then (J i<n Ui covers U;<n(^W*) = 
X \ f] i<n Ui = X \ U. So {J i<n Ui U{[/} C W is a finite subcover of X, 
contradicting the fact that U £ T . 

But (4.13) implies that for every U £U there is a V = Uj £UC\S such 
that U C V. So V = UCiS is also a cover of X. Hence, by our assumption, 
V has a finite subcover of X. This contradiction finishes the proof. □ 

Proof of the Tychonoff theorem Let 

S = {p„ 1 (Cf) : a £ A & U £ T a } 

be a subbase of X = Yl ae j[X a and let U C S be a cover of X. By the 
Alexander subbase theorem it is enough to find a finite subcover of U. For 
a £ A let 


U a = {U C X a : U is open in X a and p a 1 (t/) £ Uj. 

If for some a £ A the family U a covers X a then, by the compactness 
of X a , there is a finite subcover V C U a that covers X a . But then 
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U' = {pf l {U)\ U G V} C U is a finite subcover of X. So, to obtain a 
contradiction, assume that no U a covers X a and let x(a) G X a \ (J U a for 
every a € A. Then x G X \ \J ae ^ (J{Pa 1 (U ) : U G U a } = X \ (J U. This 
finishes the proof. □ 

EXERCISES 

1 Prove that every partial-order relation^ on a set X can be extended to 
a linear-order relation < on X. (Here < extends A if Ac<.) 

2 We say that a subset A C R is algebraically independent if for every 
nonzero polynomial p{x\, . . . ,x n ) of n variables with rational coefficients 
and any sequence a±, . . . ,a n of different elements from A, p(a±, . . . , a n ) ^ 
0. Show that there exists an algebraically independent subset A of R such 
that if Q(-4) is a field generated by Q and A then for every b G R \ Q(^4) 
there exists a nonzero polynomial p(x) with coefficients in Q(H) such that 
p(b) = 0. (A family A with this property is called a transcendental basis of 
R over Q.) 

3 A filter on a set X is a nonempty family T of subsets of X such that 
(1) A fl B G T provided A,B G T\ and (2) if A C B C X and A G T then 
B G T . A filter T on X is proper if T V(X), and it is prime if for every 
A C X either AgT or X\AgT. Show that every proper filter on a set 
X can be extended to a proper prime filter. 

4 A graph on a (finite or infinite) set V is an ordered pair {V. E) such that 
E C \V] 2 , where [ V ] 2 is the set of all two-element subsets of V . For Eq C E 
a graph (V, E 0 ) is a forest in graph (V) E) if it does not contain any cycle, 
that is, if there is no sequence vo,Vi,... ,v n = Vq with n > 2 such that 
{u,;,Ui + i} G E 0 for * G n. For E 0 C E a graph ( V,E 0 ) spans (V,E) if for 
every v G V there is a w G V such that {v, rc} G Eq. 

Let (V, E) be a graph that spans itself. Show that there exists a forest 
(V,Eq) that spans (V,E). 



Chapter 5 

Cardinal numbers 


5.1 Cardinal numbers and the continuum hy- 
pothesis 

We say that the sets A and B have the same cardinality and write Am B 
if there exists a bijection /: H — > B. 

It is easy to see that for every A, B , and C 


• Am. 

4; 


• if Ap 

s B then B ' 

« A; and 

• if At 

s B and B » 

i C then A 


Thus the “relation” m of having the same cardinality is an equivalence “re- 
lation.” We put the word relation in quotation marks since our definition 
does not specify any field for «. This means that the “field” of this “rela- 
tion” is the class of all sets. However, the restriction of « to any set X is 
an equivalence relation on X. Thus we will use the term relation for m in 
this sense. 

By Zermelo’s theorem (Theorem 4.3.3) and Theorem 4.2.4, for every set 
A there exists an ordinal number a such that Am a. The smallest ordinal 
number with this property is called the cardinality of A and is denoted by 
|H|. Thus 


|H| = min{a: a is an ordinal number and A m a}. 

In particular, A m \A\ for every A. It is not difficult to see that A m B if 
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and only if |A| = \B\. Thus the terminology “sets A and B have the same 
cardinality” can be used for A « B as well as for |A| = \B\. 

Notice also that 

|a| < a for every ordinal number a, 

since a ss a and |a| is the smallest ordinal (3 such that a ss (3. This implies 
also that 

|a| < |/3| for all ordinal numbers a < / 3. 

An ordinal number k is said to be a cardinal number (or just a cardinal) 
provided n = |A| for some set A. We can distinguish the cardinal numbers 
from all other ordinal numbers by using the following properties. 


Proposition 5.1.1 For an ordinal number a the following conditions are 
equivalent: 

(i) a is a cardinal number; 

(ii) | a | = a; 

(iii) j3 <\a\ for every f3 < a; 

(iv) \j3\ < |a| for every j3 < a; 

(v) \j3\ ^ |a| for every (3 < a. 

Proof (i)=>(ii): If a is a cardinal number then there exists a set A such 
that |A| = a. This means, in particular, that A ss a, that is, |A| = |a|. So 
|a| = |A| = a. 

(ii) =>(iii): It is obvious. 

(iii) =>(iv): It follows from the fact that \(3\ < (3 for every ordinal (3. 

(iv) =>(v): It is obvious. 

(v) =>(i): Condition (v) implies that (3 ^ a for every (3 < a. Thus a is 

the smallest ordinal 7 such that a ss 7 . But this means, by the definition 
of cardinality, that a is a cardinal number, as then a = |a|. □ 

Property (v) of Proposition 5.1.1 explains why we often say that the 
cardinal numbers are the initial ordinals, that is, ordinals that are the 
smallest of a given cardinality. 

Let us also note the following theorem. 



5.1 Cardinal numbers and the continuum hypothesis 


63 


Theorem 5.1.2 Let A and B be arbitrary sets. The following conditions 
are equivalent: 

(i) \A\<\B\; 

(ii) there exists a one-to-one function g: A—^B. 

Moreover, if T ^ 0 then these conditions are equivalent to the condition 
that 


(iii) there exists a function f : B — > A from B onto A. 

Proof For A = 0 the equivalence of (i) and (ii) is obvious. So assume that 
A yf 0. 

(i)=>(iii): Let h: B — > \B\ and g: |H| — » A be bijections. Pick a € A. 
Notice that \A\ C \B\ and /i _1 (|H|) C B. 

Define / : B — > A by 


m 


g(h(b)) for b£h 1 (|A|), 
a otherwise. 


Then 


A D f[B\ D f[h~\\A \ )] = (g o h)[h~\\A\)} = g[\A\] = A. 

So / is onto A. 

(iii)=>(ii): Let f: B — > A be onto. Let G be a choice function for the 
family { f~ 1 (a): a £ A}. Then g(a) = G(/ _1 (a)) is a one-to-one function 
from A into B. 

(ii)=>(i): Let g: A — > B be a one-to-one function and h : B — > \B\ 
be a bijection. Then {h o g) : A — » \B\ is a one-to-one function from A 
onto (h o g)[A] C \B\. Hence |H| = |(/iog)[H]| and, by Corollary 4.2.6, 
Otp((/i o 5 )[H]) < \B\. So 

1^1 = \{h° g) [A ] | = | Otp((/i o g) [H])| < Otp((/io 5 )[H]) < \B\. □ 

Corollary 5.1.3 If A C B then |H| < \B\. 

Proof Since the identity function from A into B is one-to-one, the corol- 
lary follows immediately from Theorem 5.1.2. □ 

Corollary 5.1.4 If A C B C C and |H| = \C\ then \B\ = |H|. 

Proof By Corollary 5.1.3 we have |A| < \B\ < \C\ = |H|. So \B\ = |H|. □ 

Remark It is often the case that the relation A A B for a set A having 
cardinality less than or equal to the cardinality of B, |H| < |B|, is defined 
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as in Theorem 5.1.2(ii), that is, by saying that there exists a one-to-one 
function /: A—*B. Then it is necessary to prove that A + B and B < A 
imply that there exists a bijection from A onto B. This fact is known as the 
Schroder-Bernstein theorem and follows immediately from our definition 
and Theorem 5.1.2. 

Theorem 5.1.5 lo is a cardinal number and every n £ u> is a cardinal 
number. 

Proof First we show that every n £ u> is a cardinal number. So let n £ u>. 
By Proposition 5.1.1(iv) it is enough to prove that k\ < \n\ for every k < n. 
Thus it is enough to show that the inequality \n\ < |fc| is false, that is, by 
Theorem 5.1.2, that 

there is no one-to-one function /: n — > k for every k < n. (5-1) 

We will prove (5.1) by induction on n < w. 

Notice that for n = 0 condition (5.1) is true, since there is no k < n. 
So assume that it is true for some n < u>. We will show that this implies 
(5.1) for n + 1. 

To obtain a contradiction assume that for some k < n+1 there is a one- 
to-one function /: (n + 1) — > k. If k < n then f\ n : n — » k contradicts (5.1). 
So k = n, f: (n+ 1) — > n, and f[n+ 1] (jL (n — 1), that is, n — 1 £ f[n+ 1]. 
(Notice that n — 1 exists, since the existence of / : (n + 1) — > n implies that 
n ± 0.) 

Let i < n+1 be such that f(i) = n— 1. Define g : n — > n — 1 by putting 
g(j) = f(j) for j < n, j ^ i, and g(i) = f(n). (If t = n this last condition is 
redundant and can simply be ignored.) Notice that the values of g indeed 
belong to n — 1 and that g is one-to-one. Thus g contradicts (5.1) for n. 
This contradiction finishes the proof of (5.1) and the fact that every n £ u> 
is a cardinal number. 

To see that a; is a cardinal number notice that n C (n + 1) C co for every 
n <uj. Hence, by Corollary 5.1.3, |n| < |n+ 1| < |w|. But by what we have 
already proved, |n| = n < n+1 = |n+l|. So |n| < |n + l| < |w|, that is, we 
conclude that \n\ < |w| for every n < u. Therefore, by Proposition 5.1.1, 
we deduce that w is a cardinal number. □ 

The sets with cardinality less than w, that is, equal to some n < u>, are 
called finite sets. A set A is countable if |A| = w. A set is infinite if it is 
not finite. A set is uncountable if it is infinite and not countable. 

We have already established a convention that general infinite ordinal 
numbers are denoted by the greek letters a, (3, 7 , (, 7 , £. The infinite cardi- 
nal numbers will usually be denoted by the letters n, /i, A. 

The natural numbers and u> are cardinal numbers. Are there any other 
(i.e., uncountable) cardinal numbers? The next theorem gives us a tool to 
construct a lot of uncountable cardinal numbers. 
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Theorem 5.1.6 (Cantor’s theorem) |A| < [P(X)\ for every set X. 

Proof Fix a set X. First notice that |X| < \P(X)\, since the function 
f: X — > V{X) defined by /( x) = {x} for x £ X is one-to-one. 

To finish the proof it is enough to show that \X\ ^ \V{X)\. So let 
f: X — > V(X). We will prove that / is not a bijection by showing that / 
is not onto V{X). To see it, put 

Y = {x £ X: x ^ 

Then Y £ V(X) is not in the range of /, since if there were a z £ X such 
that f(z) = Y then we would have 

Z £ Y <=> z £ { x £ X: x ^ f(x)} <t=> 2: ^ /(z) O Z ^ Y, 

a contradiction. □ 

Cantor’s theorem tell us, in particular, that for any cardinal number n 
there is a cardinal number A larger than n, namely, A = \V{n)\. We will 
denote this cardinal number by 2 K . That way, the symbol 2 K will stand 
for two objects: a cardinal number and the family of all functions from n 
into 2. Although these two object are definitely different, this notation is 
consistent in the sense that 

\V{X)\ = |2 a '| for every set X. (5.2) 

This last equation is established by a bijection X : V(X) — > 2 A defined by 
X(A) = Xa, where Xa is the characteristic function of A , that is, Xa{x) = 1 
for x £ A and Xa(x) = 0 for x £ X \ A. 

The notation 2 K is also consistent with the cardinal exponentiation op- 
eration that will be introduced in the next section. 

Notice also that for every cardinal number k there is the smallest car- 
dinal number A greater than k, namely, a cardinal equal to the minimum 
of the set 

K = {/j, £ 2 2 : p is a cardinal and n < p}. 

The minimum exists, since K is a nonempty (2 K £ K) subset of a well- 
ordered set 2 2 . This unique cardinal number is denoted by n + and is 
called the cardinal successor of k. 

It is also worthwhile to note the following property of cardinal numbers. 

Proposition 5.1.7 If T is a family of cardinal numbers then a = is 
also a cardinal number. 

Proof By Theorem 4.2.3(iii) a is an ordinal number. Moreover, if /3 < a 
then (3 £ a and there exists a n £ T such that (3 £ n. So/3<k=|k|<|q:|. 
Thus, by Proposition 5. 1.1 (iii) , a is a cardinal number. □ 
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With the use of Proposition 5.1.7 and the operation of cardinal successor 
we can construct for every ordinal number a a cardinal number uj a by 
induction on £ < a in the following way: 


w 0 = w; 


W£+i = (w^) + for every £ < a; 

Wa = u wj for every limit A < a. 

«<A 

It is easy to see that ui a is the ath infinite cardinal number. It is also 
worthwhile to mention that very often u) a is denoted by K Q . (K is a Hebrew 
letter pronounced aleph.) 

Similarly, for every ordinal number a we can construct a cardinal num- 
ber D Q by induction on £ < a as follows: 

D 0 = w; 

D^+i = for every £ < a; 

Da = [J D^ for every limit A < a. 

(D is a Hebrew letter, which we read bet.) In particular, the cardinality of 
V(u>) is equal to Di = 2 W . This number is usually denoted by c, a Hebrew 
letter c, which we read continuum. 

Obviously, ujq = to = Do and, by Cantor’s theorem, Wi < D-| = c. 
It is also not difficult to prove that eo a < D a for every ordinal number 
a. Two natural questions that arise in this context are the following: Is 
u > i = c? Is (jj a = D q for every ordinal number a? Surprisingly, both these 
questions cannot be decided within ZFC set theory. The statement that 
wi = c is called the continuum hypothesis (usually abbreviated by CH) and 
is independent of ZFC set theory (see Section 1.1). Similarly, the statement 
“u a = D q for every ordinal number a ” is called the generalized continuum 
hypothesis (usually abbreviated by GCH) and is also independent of ZFC 
set theory. 

The independence of CH from the ZFC axioms will be proved in Chap- 
ter 9. Meanwhile, we will often use CH as an additional assumption to 
deduce other properties of interest. That way, we will know that the de- 
duced properties are consistent with ZFC set theory, that is, that we have 
no way to prove their negations using only ZFC axioms. 

The last theorem of this section gives some rules on comparing the 
cardinalities of sets. 
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Theorem 5.1.8 If \A\ = \B\ and \C\ = \D\ then 

(a) \A x C\ = \B x D\; 

(b) \A U C\ = \B U D\, provided AcC = = B C D; 

(c) \A c \ = \B d \. 

Proof Let f: A — > B and g : C — > D be bijections. Define the bijections 
for (a)-(c) as follows: 

(a) F:ixC-tSxD, F(a,c) = (f(a),g(c)). 

(b) F: A U C -► B U D, F = f U g. 

(c) F: A c — > B d , and F(h) £ B D for h £ A c is defined by the formula 
F(h)(d) = f(h(g~ 1 (d))) for every d £ D. 

It is left as an exercise to show that the functions F so defined are 
indeed bijections. □ 

We will finish this section with the following remark. The only cardinal 
numbers that were defined in a “natural” way (i.e., without an essential 
use of the axiom of choice) are the natural numbers and ui. The infinite 
ordinal numbers that we can easily construct, that is, construct with the 
use of ordinal number operations, are of the form w + 1, oo + 2, w + u>, 
coco, and so forth, and are not cardinal numbers. (For example, notice that 
\uu>\ < ui by defining the one-to-one function /: wu — > ui with the formula 
f(n, m) = 2 n 3 m , where we identify loco with the set loxlo from the definition 
of coco.) In fact, there is no way to prove without the axiom of choice the 
existence of uncountable ordinal numbers. This might seem strange, since 
the proof of Cantor’s theorem evidently did not use the axiom of choice. 
However, we in fact proved there only that |X| ^ \P{X)\. The proof that 
the set V{X) can indeed be well ordered requires the use of Zermelo’s 
theorem, which is equivalent to the axiom of choice. 

EXERCISES 

1 Let n £ co. Show that if a is an ordinal number such that \a\ = n then 
a = n. 

2 Prove that co a < D Q . for every ordinal number a. 

3 Complete the proof of Theorem 5.1.8 by showing that all functions F 
defined in this proof are bijections. 
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5.2 Cardinal arithmetic 

For cardinal numbers k and A define their cardinal sum by 
k ® A = | (ft x {0}) U (A x { 1}) | , 
their cardinal product by 


k 0 A = \k x A| , 

and a cardinal exponentiation operation by 



Notice that the exponentiation operation for n = 2 gives us another def- 
inition of 2 a . However, it is consistent with the previous definition, since 
|2 a | = 2\ Notice also that by the foregoing definitions and Theorem 5.1.8, 
for all sets A and B , 

\A U B\ = \A\ ® \B\ for A and B disjoint (5.3) 

and 

\AxB\ = \A\®\B\, \A b \ = \A\^. (5.4) 

Proposition 5.2.1 For all cardinal numbers n and A 

(i) k © A = A ® k, 

(ii) k 0 A = A <g> k. 

Proof From Theorem 5.1.8 we have 

/v®A = (kx {0})U(A x { 1}) | = |(A x {0}) U(kx { 1 }) | = A ffi k, 

where the second equation follows from the fact that \k x {0} | = \n x { 1 } | 
and |A x { 1} | = |A x {0} | . Also, 

k ® A = |k x A| = |A x k \ = A ® k, 

since /(© £) = (£, C) is a bijection between kxA and A x k. □ 

Thus, unlike the ordinal operations + and •, the cardinal operations © 
and ® are commutative. 
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Proposition 5.2.2 For every m,n € to 

(i) m(Bn = m + n<io, 

(ii) to <g) n = mn < lo. 

The proof is left as an exercise. 

Lemma 5.2.3 Every infinite cardinal number is a limit ordinal number. 
Proof First notice that 

|wU{x}| = M. 

If x £ lo then this is obvious. Otherwise this is justified by a function 
f : uj U {x} — y io defined by f{x) = 0 and f(n) = n + 1 for n < lo. Hence, 
by Theorem 5.1.8(ii), for every infinite ordinal number a 

|a + 1| = |aU{a}| = |(q\u)U(wU {cr}) | = |(a \ to) U u>\ = |a|. 

Therefore, by Proposition 5.1.1, an ordinal number a + 1 is not a cardinal 
number for any infinite a. □ 

Theorem 5.2.4 If k is an infinite cardinal then k (g> k = k. 

Proof Let n be an infinite cardinal number. We will prove by transfinite 
induction on u> < a < n that 


|a| ® |a| = |a|. (5.5) 

So let lo < A < k be such that (5.5) holds for every to < a < A. We will 
show that (5.5) holds for A as well. This will finish the proof. 

If there is an a < A such that o = |A| then 

|A| <g> |A| = \a\ ® |a| = |a| = |A|. 

Hence we may assume that A is a cardinal number. 

Now notice that for every a < A 

| a x a| < A. (5-6) 

It follows from Proposition 5.2.2(ii) for a finite a and from (5.5) for an 
infinite a. Define a well ordering ^ on A x A by putting (a, (3) -< ( 7 , 6) if 
and only if 

max{a, (3} < maxjy, <5} 


or 


max{a, (3} = maxjy, < 5 } and (a, (3) <\ ex 
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where <i ex stands for the lexicographic order onsxs (see Theorem 4.1.13). 
The proof that F is indeed a well ordering is left as an exercise. 

Now let ( 7 ,6) € A x A and let e = max {7 + 1,(5 + 1}. Then £ < A 
by Lemma 5.2.3. Moreover, the initial segment 0 ^(( 7 , 8}) (generated by 
( 7 , 6) with respect to 7 !) is a subset of e x e, since (a, (3) < ( 7 , <5) implies 
that ma x{a,@} < max{ 7 , (5} < e. Hence, by (5.6), 

|Otp(Ox((7,^)))l = |Ox«7^))l < \e x e\ = |er| < A 

and so also 

0tp(0-<((7,<5))) < A. 

(Otherwise, we would have A = |A| < | 0 tp( 0 ^(( 7 , <5)))|.) Since every 
ordinal number (3 € Otp(A x A, 7 !) is of the form 0 tp( 0 ^(( 7 , 5))) we can 
conclude that Otp(A x A, t<) < A. So 

| A x A| = | Otp(A x A, t<)| < Otp(A x A, 7 !) < A = |A|. 

Thus | A x A | = |A|, since the inequality |A| < |A x A| is obvious. □ 

Corollary 5.2.5 If A and n are infinite cardinals then 
«©A = K(g>A = max{K, A}. 

Proof We may assume that A < n. Then 

k © A = |(kx {0})U(Ax { 1}) I < |kx2| < |kx A| = k® A 


and 

k®A = |kx A| < \k x k\ = k. 

Since evidently k < k © A we obtain 

k<k©A<k®A<k. □ 

Corollary 5.2.6 A countable union of countable sets is countable. 

Proof Let IF be a countable family of countable sets, T = { F n : n < c a}. 
Then for every n < u> there is a bijection f n : u> — > F n . Define function 
f : u x to \JiF by putting /(n,?n) = /„(m). Notice that / is onto. So, 
by Theorem 5.1.2, | U T\ < \u> x w| = w. Hence | (J T\ = u>. □ 

Similarly, we can prove the following. 

Corollary 5.2.7 If k is an infinite cardinal and \X a \ < n for all a < k 
then | Ua<K X a | < n. 
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The proof is left as an exercise. 


Corollary 5.2.8 A finite union of finite sets is finite. 

The proof is similar to that of Corollary 5.2.6 (use Proposition 5.2.2). 
For a set A let A <w = (J n<a) Thus A <u is the set of all finite 
sequences with values in A. 


Corollary 5.2.9 If n is an infinite cardinal , then |k < w | = k. 

Proof It is easy to prove, by induction on n < lo, that K n = k for every 
0 < n < lo. So, by Corollary 5.2.7, 


K < 


Ki = 

li«" 


n<uj 


< K. 


□ 


Corollary 5.2.10 |Q| = |Z| = to. 

Proof Clearly lo = |N| < |Z| < |Q|. To see that |Q| < w it is enough to 
show that |Q| < |2 x lo x uo\, since |2 x to x lo\ = 2 (g) u> ® uo — uo. So let 
/:2xwxw-iQbe defined by f(i,m,n ) = (— It is easy to see 
that / is onto Q. □ 


Theorem 5.2.11 |R.| = |2“| = |P(cc)| = c. 

Proof \2 U \ = 2“ = c by the definition of c. |2“| = |P(w)| was proved in 
(5.2). The equation |R| = |2“| is proved by two inequalities. 

The first inequality \2 U \ < |R| is established by a one-to-one function 
/ : 2 U — > E defined by 


/((a 0 ,ai,...» = ^ 
n<.uj 

(The range of / is Cantor’s classical “ternary” set; see Section 6.2.) 

To see that |R| < |2 W | define a one-to-one function g: R — > V(Q) by 
g{r ) = {q £ Q: q < r} (i.e., g(r) is the Dedekind cut associated with r). 
So 

|R| < \V(Q)\ = |2 q | = 2 iqi = 2“. □ 

Theorem 5.2.12 If X and k are cardinal numbers such that A > to and 
2 < k < A then k x = 2 X . 

In part.icidar, X x = 2 X for every infinite cardinal number A. 
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Proof Notice that 

2 A c k x C A a C V{\ x A). 

Hence 

2 a < k x < A A < \V(X x A) | = |2 AxA | = 2 |AxA| = 2 a . □ 

Theorem 5.2.13 If n, A, and fi are cardinals, then 

k A0 ' 1 = k a (g) and (k a ) m = k A 0m . 

Proof It follows immediately from the following properties: 

A (BuC) » (H s ) x ( A c ) for B n C = 0, (5.7) 

and 

(A B f^A BxC . (5.8) 

Their proof is left as an exercise. □ 

For a set X and a cardinal number k define 
[X]^ K = {AG V{X) : \A\ < k}, [A] <k = {Ag V(X) : \A\ < k}, 

{Xy = {AeV(X): \A\ = k}. 

Proposition 5.2.14 For every infinite set X and nonzero cardinal k < |X| 

|[X] K | = |[A]^ K | = \X\ K . 

Proof Define ip: X K —> [X]- K by ip(f) = /[«]. Since ip is clearly onto, 
and [X} K C [A]- K , we have 

|[X] K | < |[A]^ K | < \X\ K . 

To finish the proof it is enough to show that \X\ K < |[A] K |. 

To prove it define ip: X K — > [k x X] k by ip(f) = f. Clearly ip is one-to- 
one. Hence 

|A| K = |X K | 

< iMxn 

= l[l« x AC|]-| (5.9) 

= nmn 

= im, (5.io) 

where equations (5.9) and (5.10) follow from the fact that |Y| = \Z\ implies 
|[Y] K | = |[Z] K |, which is left as an exercise. □ 

We will finish this section with the following important fact. 
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Theorem 5.2.15 The family C(R) = {/ € R R : / is continuous} has car- 
dinality continuum. 

Proof Define a function R : C(R) — > R® by R(f) = /|q. By the density of 
Q in R and the continuity of our functions it follows that R is one-to-one 
(see Section 3.3). Hence 

|C(R)| < |R®| = |R| iqi = \2 U \ U = (2 U Y = 2 W0W = 2A 

The other inequality is proved by a one-to-one function c: R — > C(R), where 
c(a) : R — » R is a constant function with value a. □ 


EXERCISES 

1 Use the definitions of cardinal arithmetic operations and Theorem 5.1.8 
to prove (5.3) and (5.4). 

2 Prove Proposition 5.2.2. Hint: Show by induction on n that m + n < w 
and mn < oj. Then use Exercise 1 from Section 5.1. 

3 Prove that the relation A defined in the proof of Theorem 5.2.4 is a 
well-ordering relation on A x A. 

4 Prove Corollary 5.2.7. 

5 Complete the proof of Theorem 5.2.13 by proving (5.7) and (5.8). 

6 Complete the proof of Proposition 5.2.14 by showing that \Y\ = \Z\ 
implies |[U] K | = \[Z} K \ for every sets Y and Z and any cardinal number n. 

7 A real number r is an algebraic number if there is a polynomial p(x) 
with rational coefficients such that p(r) = 0. Show that the set A C R 
of all algebraic numbers is countable. Conclude that the set R \ A of all 
nonalgebraic real numbers has cardinality continuum. 

8 A function /: R — > R is a Baire class-one function if there exists a 
sequence /„ : R — > R of continuous functions such that /„ converges to / 
pointwise, that is, such that liniyj^oo f n (x) = f(x) for every x € R. Find 
the cardinality of the family Bi of all Baire class-one functions. 
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5.3 Cofinality 

We say that a subset A of an ordinal number a is unbounded in a if A has 
no strict upper bound in a, that is, when there is no 7 £ a with £ < 7 
for all £ £ A. It is also easy to see that A C a is unbounded in a when 
a = UU + 1 : £ € A} or when 

V7 < a € A (7 < £). 

For an ordinal number a let cf(a) be the smallest ordinal number (3 
such that there exists a function /: f3 — > a with the property that /[/?] is 
unbounded in a. Such a function / is called a cofinal map (in a). The 
number cf(a) is called the cofinality of a. Thus /: (3 — > a is cofinal in a if 

G a G (3 < fin)) 


and 


cf(a) = m in {d : there is a cofinal map /: (3 — > a}. 


Notice that 


cf(a) < a for every ordinal number a, 

since the identity map is a cofinal map. Also, 

cf(a + 1) = 1 for every ordinal number a, 

since / : 1 — > a + 1, /( 0) = a, is cofinal in a + 1. It is also easy to see that 
if cf(a) = 1 then a has a greatest element. Thus cf(a) = 1 if and only if 
cf(a) < u if and only if a is a successor ordinal. 

We will be interested mainly in cofinalities of limit ordinal numbers. 

Lemma 5.3.1 For every ordinal number a there is a strictly increasing 
cofinal map f: cf(a) — > a. 

Proof For a a successor ordinal the map just shown works. 

If a is a limit ordinal and g: cf(a) — » a is a cofinal map, define by 
transfinite induction on 7 < cf(a) 

fW) = m ax J g(rj), (J (/(£) + 1) 
l £<v 



Clearly /(£) < f(() + 1 < /(?y) for every £ < ij < cf(a). Thus / is strictly 
increasing. 
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The fact that 


f(rf) £ a for every 77 < cf(a) (5-11) 

is proved by induction on ? 7 < cf(a). If for some 77 < cf(a) condition (5.11) 
holds for all £ < 77 then f(rj) = + 1)} < a. But f{rf) 

can’t be equal to a since otherwise the restriction of / to 77 would be a 
cofinal map in a and this would contradict the minimality of cf(o). Thus 
77 < cf(a). Condition (5.11) has been proved. 

Finally, / is cofinal in a since for every £ £ a there is an 77 < cf(a) with 

£ < ff(v) < f(v)- □ 

Corollary 5.3.2 cf(cf(a)) = cf(a) for every ordinal number a. 

Proof Let /: cf(cf(a)) — > cf(a) and g: cf(a) — > a be strictly increasing 
cofinal maps. Then go/: cf(cf(cc)) — » a is also cofinal, since for every 
£ < a there exist 77 £ cf(a) with £ < g{rj) and / £ cf(cf(a)) with 77 < /(/), 
so 

£ < g{i 1 ) < g{f{ 0 ) = (g° /)( 0 - 

Thus, by the minimality of cf(a), we have cf(cf(a)) > cf(a). So cf(cf(a)) = 
cf(a). ' ' □ 

An ordinal number a is regular if it is a limit ordinal and cf(a) = a. 

Theorem 5.3.3 If a is a regular ordinal number then a is a cardinal num- 
ber. 

Proof Put (3 = | cf(a)|, and choose a bijection /:/?—> cf(a) and a cofinal 
function g: cf(a) —7 a. Then (go/) : /? — > a is cofinal in a, since (go/) [/5] = 
g[/[/?]] = g[cf(a)]. Hence, by minimality of cf(a), cf(a) < /3 = |cf(a)|. □ 

Proposition 5.3.4 u> is regular. 

Proof It follows easily from Corollary 5.2.8. □ 

Theorem 5.3.5 n + is regular for every infinite cardinal number k. 

Proof Let a < n + and /: a — » k + . Then |a| < k and \f(£)\ < n for every 
£ < a. Hence llW /(0 + 1 ] | < k since a union of < n sets of cardinality 
< k has cardinality < k (see Corollary 5.2.7). Thus U$< a [/(0 + 1] 7 ^ K+ ! 
that is, cf(« + ) a for every a < n + . So cf(« + ) = n + . □ 

One of the most useful properties of regular cardinals is given in the 
following proposition. 
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Proposition 5.3.6 If X is a regidar cardinal, A C A, and |A| < A then 
there is an a < A such that A C a. 

Proof Let (3 = Otp(A) and let /: (3 — > A C A be an order isomorphism. 
Notice that (3 < cf(A), since |/3| = |A| < A = cf(A). So the set A = /[/?] 
cannot be unbounded in A. This implies the existence of an a < A such 
that £ < a for all £ £ A. Thus A c a. □ 

Another useful fact is the following. 

Proposition 5.3.7 If a < u>\ is a limit ordinal, then cf(cc) = ui. 

The proof is left as an exercise. 

We will finish this section with the following cardinal inequality. 
Theorem 5.3.8 For every infinite cardinal ft 

ft cf(K) > K. 

Proof Let /: cf (ft) — > ft be a cofinal map and let G: k — » ft cf ( K ). We will 
show that G is not onto K cf ( K ). 

Define h: cf(ft) — > ft by 

h(a) = min (ft \ {G(^)(a) : £ < /(a)}) 

for a < cf(ft). Such a definition makes sense, since |{G(£)(a) : £ < f(a)}\ < 

| f(a) | < ft. But for every £ < ft there exists an a < cf (ft) such that 
£ < /(a), that is, such that h(a) yf G(£)(a). Hence h yf G(£) for every 
£ < ft and h fL G[ft]. □ 

Corollary 5.3.9 cf(c) > u>. 

Proof Assume, to obtain a contradiction, that cf(c) = w. Then c cf ^ c ^ = 
(2“)“ = = c, contradicting Theorem 5.3.8. □ 

EXERCISE 


1 Prove Proposition 5.3.7. 
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Chapter 6 


Subsets of R n 


6.1 Strange subsets of MJ 1 and the diagonal- 
ization argument 

In this section we will illustrate some typical constructions by transfinite 
induction. We will do so by constructing recursively some subsets of 1" 
with strange geometric properties. The choice of geometric descriptions of 
these sets is not completely arbitrary here - we still have not developed 
the basic facts concerning “nice” subsets of R" that are necessary for most 
of our applications. This will be done in the remaining sections of this 
chapter. 

To state the next theorem, we will need the following notation. For a 
subset A of the plane R 2 the horizontal section of A generated by y £ R 
(or, more precisely, its projection onto the first coordinate) will be denoted 
by A v and defined as A y = {x £ R: (x,y) £ A}. The vertical section of A 
generated by x € R is defined byi I = {yGR: (x, y) £ A}. 

We will start with the following example. 

Theorem 6.1.1 There exists a subset A of the plane with every horizontal 
section A v being dense in R and with every vertical section A x having 
precisely one element. 

Proof We will define the desired set by induction. To do so, we will first 
reduce our problem to the form that is most appropriate for a recursive 
construction. 

The requirement that every horizontal section of A is dense in R tells 
us that A is “reasonably big.” More precisely, the condition says that 
A y fl (a, b) 0 for every a < b and y £ R. Therefore if we put T = 
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{(a, b) x {y} : a, b, y G R & a < 6} then we can restate it as 

for every J G T . (6.1) 

The requirement that every vertical section of A has precisely one point 
tells us that A is “reasonably small.” It can be restated as 

| A fl [{x} x R] | = 1 for every xtl. (6.2) 

The idea of constructing A is to “grow it bit by bit” to obtain at the 
end a set satisfying (6.1), while keeping approximations “reasonably small.” 
More precisely, by transfinite induction on £ < c, we will define a sequence 
{(x£, y %) : £ < c) such that A 0 = {(x^, : £ < c} satisfies (6.1) and has the 

property that | A 0 fl [{x} x R]| < 1 for every xel. It is enough, since then 
the set A = A 0 U {(x, 0) : A 0 n [{x} x R] = 0} satisfies the theorem. 

To describe our construction of the sequence, notice first that \T\ = c, 
since c = |R| < | T\ < |K 3 | = c. Let { J \ \ £ < c} be an enumeration of 
the family T. If for some £ < c the sequence ((x^,y^): £ < £} is already 
defined, choose 


(x 62 /{) £ \ 


x R 
C <€ 


(6.3) 


The choice can be made, since for every ( < ewe have \J^\ = R = c > 
|£| > Jj fl Uc <4 x R • S°, by the recursion theorem, the sequence 
((x{,%): ^ < c) exists. (More precisely, we define (x^,y^) = f(£) = h(f |j) 
where the oracle function h is defined implicitly by formula (6.3). Explicitly, 
h(((x^,y^}: C < 0) is defined as C(Jj \ {(x^,y^): ( < £}), where C is a 
choice function from the family 'P(R 2 ) \ {0}.) 

It is clear that A 0 = {(x^,y^): £ < c} satisfies (6.1), since for every 
J £ J 7 there exists a £ < c such that J = Jc and (x^, y^) G A 0 n = A 0 n J. 
It is also easy to see that the choice as in (6.3) implies that every vertical 
section of Aq has at most one point. □ 


The proof of Theorem 6.1.1 is a typical example of a diagonalization 
technique. In order to prove the theorem we had to find for every J G T 
a point ( xj,yj ) G J that would belong to a future set A. The choice was 
supposed to be done while preserving at every step the property that no 
two points chosen so far belong to the same vertical line. In order to prove 
the theorem we listed all elements of T and made our construction, taking 
care of each of its elements one at a time, using the fact that we still had 
“enough room” to make our choice. This technique of making inductive 
constructions by listing all important objects for the desired property and 
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then taking care of each of them one at a time is called the diagonalization 
technique. It is typical for such constructions that the inductive step is 
possible because of some cardinal argument similar to the one given earlier. 
The preservation part, like that mentioned before, is usually the hard part 
of the argument. 

The next theorem, due to Mazurkiewicz (1914), is very similar in char- 
acter to Theorem 6.1.1. 

Theorem 6.1.2 There exists a subset A of the plane R 2 that intersects 
every straight line in exactly two points. 

Proof Let {L^: £ < c} be an enumeration of all straight lines in the 
plane R 2 . By transfmite induction on £ < c we will construct a sequence 
{Aj : £ < c} of subsets of R 2 such that for every £ < c 

(I) has at most two points; 

(P) [Jc<£ does not have three collinear points; 

(D) [Jc<£ contains precisely two points of L £. 

Then the set A = U^< c will have the desired property. This is the 
case since the preservation condition (P) implies that every line contains 
at most two points of A, while the diagonal condition (D) makes sure that 
every line L contains at least two points from A. Thus, it is enough to show 
that we can choose a set A^ satisfying (I), (P), and (D) for every £ < c. 
This will be proved by induction on £ < c. 

So assume that for some £ < c the sequence £ < £} is already 
constructed. By condition (I) the set B = Uc<£ h as cardinality < c, 
being a union of |£| < c many finite sets. Similarly, the family Q of all lines 
containing two points from B has cardinality < \B 2 \ < c. Notice that by 
the inductive assumption (P) the set B ft Lj has at most two points. If 
it has precisely two points put A^ = 0 and notice that (P) and (D) are 
satisfied. If B D contains less than two points then L ^ intersects every 
line from Q in at most one point. Thus Ci\JQ = U Lgg fl L has at 
most \Q\ < c many points. 

Choose ^ C Tj \ (J Q to have two elements if B fl = 0 and to have 
one element if B fl has one point. It is easy to see that conditions (P) 
and (D) are satisfied with either choice of The construction has now 
been completed. □ 

The next theorem is yet another example of a geometric problem solved 
with help of the recursion theorem. In this theorem a circle in R 3 is un- 
derstood to be a set of points forming any nontrivial circle in any plane 
in R 3 . 
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Theorem 6.1.3 R 3 is a union of disjoint circles. 

Proof We have to construct a family C of subsets of R 3 such that (1) each 
C £ C is a circle, (2) the family C covers R 3 , and (3) different elements of 
C are disjoint. We will construct the family C as {Cj: £ < c} by induction 
on £ < c. For this, the natural approach is the following. Choose an 
enumeration {pj : £ < c} of R 3 , and for each p j choose a circle C j such that 
£ C%. This certainly would take care of (2); however, then we could not 
keep circles disjoint: For p^ £ Co \ {po} the circles Co and C % would not 
be disjoint. Thus we have to settle for a slightly weaker condition: We will 
choose C % such that 

(D) P{£lJc<{C<;- 

This is the “diagonal” condition that will imply that C covers R 3 . However, 
we have to make the choice in such a way that circles in C are pairwise 
disjoint. We will do this by assuming that for each £ < c the circles 
constructed so far are pairwise disjoint and we will choose C ^ preserving 
this property. More precisely, we will choose Cj such that the following 
“preservation” condition is satisfied: 

(P) QnU« ? C f = 0. 

Evidently, if we could construct C = {C^: £ < c} such that conditions (D) 
and (P) are satisfied for every £ < c, then C would be the desired family of 
circles. 

On the other hand, the recursion theorem tells us that we can find such 
a sequence (Cf : £ < c) as long as the family of all circles Cj satisfying (D) 
and (P) is nonempty for every £ < c. (Since then the choice function will 
work as an oracle function.) 

So assume that for some £ < c the sequence {C^: £ < £} is already 
constructed satisfying (D) and (P). We will prove that there exists a circle 

satisfying (D) and (P). 

If Pi ^ Uc<{ C c define p = pj. Otherwise, choose an arbitrary point 
p £ R 3 \Uf<j Cf- This can b e done since for every straight line L in R 3 the 
set Ln U c <c C C = U c<? (-bnO c ) has cardinality less than continuum, being 
a union of less than continuum many finite sets Lfl C<p We will choose C { 
containing p and satisfying (P). To do this, take a plane P containing p that 
does not contain any of the circles Cq for £ < £. This can be done since 
there are continuum many planes passing through p and there are only 
< |£| < c many planes containing circles from {C^: £ < £}. Now notice 
that the plane P intersects each circle C \ in at most two points. Thus the 
set S = P fl Uc<{ C C I ias cardinality less than continuum. Fix a line L in 
P containing p and let Cq be the family of all circles in P containing p and 
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tangent to L. Notice that different circles from Co intersect only at the 
point p. Thus there is a circle Cj G Co disjoint from S. This finishes the 
inductive construction and the proof. □ 

If in the previous three theorems the reader got the impression that we 
can construct a subset or a partition of R" with almost arbitrary paradox- 
ical geometric properties, the next theorem will prove that this impression 
is wrong. 

Theorem 6.1.4 The plane R 2 is not a union of disjoint circles. 

Proof Assume, to obtain a contradiction, that there is a family T of 
disjoint circles in R 2 such that R 2 = (J T. Construct, by induction on 
n < u>, a sequence { C n : n < u>} oi circles from T in the following way: 
Start with an arbitrary circle Cq G T and in step n + 1 choose a circle 
C n+ 1 G T that contains the center c n of circle C n . 

Notice that if r n is the radius of C n then |c n +i — c n \ = r n+ 1 < r n /2. 
Thus (c n : n < io) is a Cauchy sequence. Let p = limn^oo c n . Then p 
belongs to the closed disk D n bounded by C n for every n <uj. So p cannot 
belong to any circle C n since it belongs to D n+ 1 , which is disjoint from C n . 

Now let C G T be such that p € C. Then C ^ C n for every n <ui. But 
if n < to is such that r n is smaller than the radius of C then C fl C n ^ 0. 
This contradicts the choice of T . □ 

The next theorem shows that a simple diagonalization may lead to 
messy multiple-case considerations. In the theorem R 2 will be considered 
as a vector space. In particular, A + r will stand for the algebraic sum of 
a set A C R 2 and r G R 2 , that is, A + r = {a + r : a G A }. 

Theorem 6.1.5 There is a subset A of R 2 that intersects every one of its 
translations in a singleton, that is, such that the set (A+r)nA has precisely 
one element for every r G R 2 , r^O. 

Proof Enumerate R 2 \ {0} as {rj: £ < c}. We will define, by induction 
on £ < c, the sequences (a$ G R 2 : £ < c), {b$ G R 2 : £ < c), and (A^ : £ < c) 
such that for every £ < c 

(I) At = {a c : (<f}U{b c : ( < f}; 

(D) bt~at = r £ ; 

(P) if a,b,a',b' £ A^ are such that b — a = b' — a' ^ 0 then a = a' and 
b = b’. 

Before we describe the construction notice that A = Ut<c A; ^ ias the de- 
sired properties: (A + r) n A has at least one element by the “diagonal” 
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condition (D) for r = rj, since b j £ (H + rj) ft A; it has at most one element 
by the “preservation” condition (P), where we take r = b — a = b' — a' . 

To make an inductive step assume that for some £ < c the sequences 
are already constructed. The difficulty will be to find points a j and b j 
satisfying (D) while preserving (P). 

Let B = U c <| Ay Notice that B satisfies (P). 

If B fl {B + rj) ^ 0 and b G B fl {B + rj) then a = b — rj € B and we 
can define a^ = a and be = b. Condition (D) is satisfied by the choice of a 
and b , and condition (P) holds, since = B. 

So assume that B fl (B + r^) = 0 and notice that if we find a^ = x then 
be = d£ + = x + rj would be uniquely defined. Thus, it is enough to find 

x guaranteeing satisfaction of (P). In order to find it we have to avoid the 
situation 


b — a = b' — a' 0 and b b' , (6-4) 

where a , b, a', b' £ A^ = B U {x,x + r^}. So take x, a, b , a ' , b' that satisfy 
(6.4). 

There are several cases to consider. First notice that by our inductive 
hypothesis {a, b, a', b’} H {x, x + r^} y^ 0. 

In the first group of cases we assume that all elements a,b,a',b' are 
different. Then there are the following possibilities: 

• {a, 6, a' , b 1 } fl {x, x + r^} = {x}. Then x = c + d — e for some c,d,e € 
B. To avoid this situation it is enough to take x outside the set Si = 
{c+ d- e: c,d,e £ B}. 

• {a, 6, a', b 1 } fl {x, x + r^} = {x + r^}. Then x = c + g? — e — for some 

c,d,e £ B. To avoid this situation it is enough to take x outside the 
set S 2 = {c + d — e — : c,d,e £ B}. 

• {a, 6, a', b 1 } fl {x, x + r^} = {x, x + re}- 

If {a, 6} = {x, x + ?’j} then ±rj = b' — a' . However, this would contradict 
the fact that B fl {B + r^) = 0. Similarly {x,x + r{\ cannot be equal 
to {o', 6'}, {a, a'}, or {b, b 1 }. 

If {x, x + = {a, b 1 } or {x, x + r^} = {a', 6} then 2 x = c + d — for 

some c,d £ B. To avoid this situation it is enough to take x outside the 
set S 3 = {(c + d — r^)/2: c,d£ B}. 

The second group of cases consists of the situations in which some of 
the elements a, b , a', b' are equal to each other. It is easy to see that there 
are only two possibilities for such equations: b = a 1 and a = b' . Moreover, 
these equations cannot happen at the same time. Thus assume that a! = b 
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and that the other elements are different (the case when a = b' is identical) . 
Then 2 b = a + b'. 

All the cases when b ^ {x,x + r{\ are identical to those previously 
considered. So assume that b £ {x,x + r^}. Then there are the following 
possibilities: 

• b = x. Then 2x = a + b' . 

If x + r^ ^ {a, b'} then 2x = c + d for some c,d £ B. To avoid this 
case it is enough to take x outside the set S 4 = {(c + d)/2: c,d £ B}. 

If x + € {a, b'} then 2x = c + x + r^ for some c £ B. To avoid this 

situation it is enough to take x outside the set S 5 = {c + rj : c £ B}. 

• b = x + r%. Then 2x = a + b' — 2 r$. 

If x ^ {a, b'} then 2x = c + d — 2 r^ for some points c,d £ B. To 
avoid this particular situation it is enough to choose x outside the set 
Sq = {(c + d — 2r^)/2: c,d£ B}. 

If x £ {a, b'} then 2 x = c + x — 2 r^ for some c £ B. To avoid this case 
it is enough to take x outside the set S 7 = {c — 2 : c£ B}. 

It is easy to see that each of the sets S, for i = 1, . . 7 has cardinality 
< |£| 3 < c. Thus we can take x from the set R 2 \ (jJ =1 <?»• □ 

In all of the previous examples of this section we used preservation 
conditions of finite character such as noncollinearity, finite intersections 
of different circles, and so forth. This made obvious the fact that these 
conditions were preserved when we took a union of previously constructed 
sets. This approach, however, does not always work. For example, the next 
theorem will state that R 2 is a union of countably many sets Si such that 
no set Si contains two different points of rational distance apart. If we try 
to prove this theorem starting with an arbitrary well ordering { p ^ : £ < c} 
of R 2 and at step £ try to add the point p $ to some set Si, then already at 
step to we might find ourselves in trouble - it might happen that for every 
i < to there is already an element p n £ Si, n < to, of rational distance from 
p u . If so, we could not assign p u to any 5, . To solve these difficulties we 
will proceed in a different manner, to be described subsequently. 

We will start with the following easy but very useful lemma that will 
also be used very often in the rest of this text. To state it, we need the 
following definitions. Let F: X k -4 [A']- w for some k < to. We say that 
a subset Y of X is closed under the action of F if F(yi, . . yjf) C Y for 
every yi, . . ., y^ £ Y. If /: X k — > X then Y C X is closed under the action 
of f if it is closed under the action of F, where F(x) = {/(x)}. Finally, if 
T is a family of functions from finite powers of X into either [A]- w or X 
then Y C X is closed under the action of T if it is closed under the action 
of F for every F £ T . 
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Lemma 6.1.6 Let T be an at most countable family of functions from 
finite powers of X into [X]- w or X. Then 

(a) for every Z C X there exists a smallest subset Y of X closed under 
the action of T and containing Z; this set, denoted by cl p{Z), has 
cardinality less than or equal to \Z\ + u>; in particular, \ c\j:{Z)\ = \Z\ 
for every infinite Z; 

(b) if |X| = k > u then there exists an increasing sequence (X a : a < k) 
of subsets of X closed under the action of T such that X = 

\X a \ < k for all a < k, and X\ = {J a< \X a for every limit ordinal 
A < K. 

Proof In the proof we will identify functions / : X — > X from T with 
F: X -+ [X]\ F(x) = {/(*)}. 

(a) Let p = \Z\ + lv. Construct, by induction on n < ui, an increasing 
sequence (Z n G [AT] M : n < u) by putting Zq = Z and defining Z n+ \ as 

Z n \j\J{F(z u ... ,Zm): F£Fk AoitvF = X m & ( Zl ,...,z m ) G Z™}. 

Notice that \Z n+ i\ < p. This is the case since \Z n \ < p and the second 
set is a union of a family of sets of cardinality < to indexed by a set of 
cardinality < \F\ (g) \Z n \ <u < p. 

Define cl f(Z) = U n<u , Z. n . Obviously Z C cl^-(Z) C X and | cl^-(Z)| < 
p , as c1jf(Z) is a union of countably many sets of cardinality at most p. 

To see that cl^-(Z) is closed under the action of T take F G F . If X m 
is the domain of F and Z\, . . . ,z m G cl^-(Z) then there is an n < ui such 
that zi,... , z m G Z n , and so F(z i, . . . , z m ) C Z n+1 C cljr(Z). 

In order to prove that cl p(Z) is the smallest subset of X closed under the 
action of F and containing Z let Y be another such set. Then Zq = Z C Y 
and by an easy induction we can prove that Z n C Y for every n < u>. So 

cl^-(Z) = U ra<w Z n C Y. 

(b) Enumerate X as {a;^: f < n} and define X a = cbr({:r£: f < a}). 

Then the equation X = [^ a<Kj X a is obvious and \X a \ < \a\+u> < n for 
a < k follows immediately from (a). The two other conditions are simple 
consequences of the fact that cl^-(Z) is the smallest subset of X containing 
Z and being closed under the action of T . □ 

The next theorem is due to Erdos and Hajnal (Erdos 1969). 

Theorem 6.1.7 There is a countable partition {S): i < lo] of R 2 such 
that the distance between any two different points of the same set Si is 
irrational. 
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Proof By induction on k = X we are going to prove that every set 
Icl 2 can be decomposed as described in the theorem. 

So let I C K 2 be such that |X| = k and suppose that the foregoing 
statement is true for every Y C K 2 of cardinality < k. We have to prove 
the statement for X. 

If k < w then the statement is obvious, since we can put every element 
of X in a different set S z . So assume n > to. 

For every p,q £ Q define F pq : (R 2 ) 2 — > [R 2 ] <UJ by 

F pq (x,y) = {z £ R 2 : \x- z\=p & \y-z\ = q}. 

Notice that indeed F pq (x,y) € [R 2 ] <aJ since it has at most two points. Let 
T = { F pq : p, q £ Q} and let X = U a<s ^a, where (X a : a < k) is a 
sequence of subsets closed under the action of T as in Lemma 6.1.6(b). 

We will define a decomposition of X into sets S) by defining g : X — » u> 
and Si = < 7 _1 (i). (Thus the function g tells us that an element x £ X is 
put into S g(x) .) The function g must have the property that 

if g(x) = g(y) for different x,y £ X then \x — y\ ^ Q. (6.5) 

We will define the function g inductively on the sets X a for a < n. 

Assume that for some (3 < n the function g is already defined on each 
X a for all a < (3. If (3 is a limit ordinal, then g is already defined on 
Xp = Ua </3 X a and it is easy to see that it satisfies property (6.5). Assume 
that (3 is a successor ordinal, say (3 = a + 1. Then g is defined on X a . We 
have to extend our definition to the set Z = X a+ i \ X a . 

Since \Z\ < |X Q+ i| < k we can find a function h: Z — > u> satisfying 
(6.5). We might try to define g on Z as h. However, this might not work, 
since for z £ Z there might be an x £ X a such that \z — x\ £ Q. But 
X a is closed under the action of T . Thus for every z £ Z there is at most 
one x z £ X a such that \z — x z \ £ Q. So we will define g on Z such that 
for every z £ Z and h(z) = n we have g(z) £ {2 n, 2 n + 1}. Then g\z will 
satisfy (6.5), as h did. Now it is enough to choose g(z) different from g(x z ), 
if x z exists, and arbitrarily otherwise. □ 

It can be proved also that R" can be decomposed as in Theorem 6.1.7 
for all «eN. However, the proof for n > 2 is considerably more difficult. 

The last theorem of this section is due to Sierpinski (1919). Although 
its proof is essentially easier that those of previous theorems it has its 
own flavor - the set constructed in the theorem exists if and only if the 
continuum hypothesis is assumed. 
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Theorem 6.1.8 The continuum hypothesis is equivalent to the existence 
of a subset A of R 2 with the property that 

\A v \<u> and | (R 2 \ A)J < ui 


for every x,y £ R. 

Proof =>: First assume the continuum hypothesis c = u>i and let A be a 
well ordering of R of type uj\. (To find it, take a bijection /: R — > u>\ and 
define x Ay if and only if f(x) < f(y)-) Define A = {(x, y) : x A y}. 

Now A v = {x £ R: (x,y) £ A} = {x £ R: x A y} is an initial segment 
of a set with order type w i, so it has cardinality < u> i, that is, < to. 
Similarly, 

(R 2 \ A) x = {{(x,y) £ R 2 : (x,y) fL A}) x 

= ({(x,y) £ R 2 : y A x}) x 

= {y£R:y^x} 

is an initial segment of a set with order type u)\, that is, has cardinality 
< ui. 

<=: Assume that |R| = c > u>\ and let A C R 2 be such that \A V \ < u> for 
every y £ R. We will show that the complement of A has an uncountable 
vertical section. 

Let Y C R be such that |T| = (if /: c — » R is a bijection, take 
Y = f[u> i]). Let X = U yGY A v ■ Then \X\ < as it is a union of u>\ sets 
of cardinality < u>\. Take x £ R \ X. Then (x,y) fL A for every y £ Y, 
since x ^ A v . Hence {:c} x Y C R 2 \ A and so Y C (R 2 \ A) x . Therefore 
| (R 2 \ A) x | > u>i > to. □ 


EXERCISES 

1 Complete the proof of Lemma 6.1.6(b) by showing that for every limit 
ordinal A < w\ the set X\ = (J Q ,< A X a is closed under the action of T as 
long as every X a is closed under the action of T for a < A. 

2 Prove that R 3 \ Q 3 is a union of disjoint lines. 

3 Generalize Theorem 6.1.3 by proving that R 3 is a union of disjoint circles 
of radius 1. 

4 Modify Mazurkiewicz’s theorem (Theorem 6.1.2) by proving that there 
exists a subset A of the plane R 2 that intersects every circle in exactly three 
points. 
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5 Modify Sierpiriski’s theorem (Theorem 6.1.8) by proving that the contin- 
uum hypothesis is equivalent to the following statement: There exist sets 
Aq,Ai,A2 C R 3 such that R 3 = A$ U A\ U Ai and 

\{{x 0 ,X ll X2) G Ail Xi = x}\ < u> 


for every i < 3 and itR. 

Try to generalize this and Theorem 6.1.8 to R™ for all 1 < n < u>. Prove 
your claim. 

6 (Challenging) Prove the following theorem of Ceder: There is a decom- 
position {S'i}j< w of R 2 with no .5) spanning an equilateral triangle. Remark: 
It is also possible to find a decomposition {Si}i <UJ of R 2 with no .5) spanning 
an isosceles triangle. This has been proved recently by Schmerl. 


6.2 Closed sets and Borel sets 

In what follows we will concentrate on the topological structure of R". 
Recall that Q" is a countable dense subset of R" and that the family 
B = {B(p,e): p G Q n & £ G Q} of open balls forms a countable base for 
R™, that is, for every open set U in R" and every p G U there is a B £ B 
such that p G B C U. 

A point p of a subset P of R" is an isolated point of P if there is an 
open set U (in R" or in P) such that U D P = {p}. A subset S of R" is 
discrete if every point of S is isolated in S. A nonempty closed subset F 
of R” is said to be perfect if it has no isolated points. 

We will start this section with a study of the structure of closed subsets 
of M". For this we will need a few theorems that are of interest in their 
own right. 

Theorem 6.2.1 

(i) Every family U of pairwise- disjoint open subsets of R n is at most 
countable. 

(ii) Every discrete subset S of R" is at most countable. 

(iii) If a is an ordinal number and {S^: £ < a} is a strictly increasing 
(decreasing) sequence of open (closed) sets in R ra then a is at most 
countable. 


Proof (i) Define f :U\ {0} — > Q" by choosing f{U) G U ft Q™ for every 
U GW. Then / is one-to-one, and so \U\ < |Q"| + \ = w. 
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(ii) For every s € S let r s > 0 be such that S f! B(s,2r s ) = {s} and 
let U = { B(s,r s ): s € S}. Notice that B(s,r s ) n B(t,r t ) = 0 for distinct 
s,t € S. Thus, by (i), U is at most countable. Hence |S| < \U\ < ui. 

(iii) We assume that U = {S^ : £ < a} is a strictly increasing sequence 
of open sets. The argument for the other case is essentially the same. 

To obtain a contradiction assume that a is uncountable. Thus a > uq. 
For £ < uq choose x $ € S|+i \ S j and let B^ G B be such that x^ € B^ C 
«£+!■ Then the set X = {a;j: £ < u>i} is uncountable, since ^ x f for 
C < £ < uq. But the function f(x{) = is one-to-one, since 

for ( < £ < uq, x^ £ B^\B^, that is, B £ ^ B^. However, this is impossible, 
since it would imply uq = |A| < \B\ = u>. 

This contradiction finishes the proof. □ 

Theorem 6.2.2 For an ordinal number a the following conditions are 
equivalent: 

(i) a <u> i; 

(ii) there exists an S C R well ordered by the standard relation < of order 
type a. 

Proof (ii)=>(i) : Assume, to obtain a contradiction, that for some a > uq 
there exists an S C R with order type a. Let h: a — > S be an order iso- 
morphism and define C/j = + 1)) for £ < uq. Then {[/j : £ < uq} 

is an uncountable family of nonempty, open, and pairwise-disjoint subsets 
of R. This contradicts Theorem 6.2. l(i). 

(i)=t>(ii): For the other implication we will prove the following fact by 
induction on a < uq: 

(*) for every a, b € K. with a<b there exists an S C [a, b) with Otp(S’) = a. 

For a = 0 the condition is clearly true. So let 0 < a < uq be such that 
(*) holds for all (3 < a. We will show that (*) holds for a. 

Take a,b £ R with a < b. If a = (3 + 1 is an ordinal successor, pick c 
with a < c < b and S C [a, c) with Otp(S) = (3. This can be done by the 
inductive assumption. Then S'U{c} C [a,b) and Otp(5U{c}) = (3+1 = a. 

If a is a limit ordinal, then cf(a) < a < uq and cf(a) is infinite. So 
cf(a) = u>. Let {a n < a: n < uj} be a strictly increasing sequence that is 
cofinal in a with ao = 0. Find a strictly increasing sequence a = ao < a± < 
■ ■ ■ < a n < ■ ■ ■ < b (for example, put a n = b— [( b — a)/(n + 1)]). For every 
n < to choose S n C [a n , a n + 1 ) with Otp(S'„) = Otp(a n +i\a n ). This can be 
done by the inductive hypothesis, since Otp(a n+ i \ a n ) < a n+1 < a. Put 
S = Un<o? ^ n ‘ Tlien S C [a, b). Moreover, if /„: (a n+1 \a„) — > S„ is an 
order isomorphism then / = (J n<w f n : a — > S is also an order isomorphism, 
that is, Otp(S') = a. Condition (*) has been proved for every a < uq. 

Now (*) clearly implies (ii). □ 
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Theorem 6.2.3 Every perfect subset P o/R" has cardinality continuum. 

Proof Clearly |P| < |R"| = c. Thus it is enough to show that |P| > c. 
Before we prove it we notice the following easy fact. 

(A) If R c R" is perfect, U C R" is open, and RnU = R fl cl(Z7) ^ 0 
then R fl cl(17) is perfect. 

Clearly R n cl(f7) is closed. Also, p € R fl cl(P) cannot be isolated since 
otherwise there would exist an open set W C M n with the property that 
{p} = (R n cl(C/)) n W = {R n U) fl W = R n (U n W) and the set U n W 
would show that p is an isolated point of R. 

Now let Pi : R" — > R be the projection onto the itli coordinate. If 
there exists an i < n such that [P] | = c, then |P| > c. So, to obtain a 
contradiction, assume that 

(B) |pj[P]| < c for every i < n. 

Notice also that 

(C) if R C P is perfect, then there are disjoint perfect sets P 0 ,Pi C R. 

To see it, let i < n be such that Pi[R] contains two numbers a < b. By (B) 
we can find r £ (a, b) \p,;[P]. Then, by (A), the sets Rq = Pflp“ 1 ((— oo, r]) 
and Pi = P fl p“ 1 ([r, oo)) are disjoint and perfect. 

Now we construct a family {P s cP: s £ 2 <UJ } of perfect bounded sets 
by induction on the length |s| of a sequence s. 

We put P® = P fl [a, b] n , where a < b are chosen in such a way that 
P 0 ^ 0 and a,b £ R\Uj<„Pi[-P]- This can be done by (B). So PC\(a,b) n = 
Pfl [a, b] n . In particular, Pg is compact and, by (A), perfect. We will 
continue the induction, maintaining the following inductive condition to be 
satisfied for every s £ 2 <w . 

(I) P s o and P sl are disjoint perfect subsets of P s , 

where sO and si stand for the sequences of length |s| + 1 extending s by 0 
and 1, respectively. We can find such sets by (C). 

Now notice that for / € 2“ the sets Pf\ k for k < u> form a decreas- 
ing sequence of nonempty compact sets. Thus, the sets Pf = Plfc<w Pf\k 
are nonempty. Notice also that the sets Pf and P g are disjoint for dif- 
ferent f,g € 2". To see this, let k = min{i € w: f(i) ^ g{i)}- Then 
{f(k),g(k)} = {0,1} and 

Pf C Pg C Pf\ k + 1 n Pg\ k+1 = PsO Pi Psl = 0) 


where s = f\ k = g\k ■ 
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Let ip: 2“ — > P be such that ip(f) £ Pf (this is a choice function). Then 
ip is clearly one-to-one and so |P| > c. □ 

The induction constructions of the preceding proof are often referred 
to as tree constructions , since T = 2 <u is called a tree. (We imagine 
every s £ T as a “tree-branching point,” where the tree is splitting into sO 
and si.) 

For F C R" we denote by F' the set of all limit points of F, that is, all 
points of cl(.F) that are not isolated. Clearly F \ F' is equal to the set of 
all isolated points of F. 

Now we are ready to prove the following theorem characterizing closed 
subsets of R". 

Theorem 6.2.4 (Cantor-Bendixson theorem) Every uncountable, closed 
subset F of R" can be represented as a disjoint union of a perfect set P 
and an at most countable set C . 

Proof By induction on a < u>\ we construct a sequence { F a : a < u>i} of 
closed subsets of R n by defining: 

F 0 = F, F\ = P| F$ for A < uq, A a limit ordinal, 
f<A 

and 

F a+ 1 = {F a y. 

The sequence {F a : a < tq} is decreasing and formed of closed sets. It 
can’t be strictly decreasing by Theorem 6.2.1 (iii) . Thus there exists an a < 
u>i such that F a = F a+ 1 . This means that F a does not have any isolated 
points, that is, F a is either perfect or empty. But F\F a = {Jg <a {Fp\Fp + i) 
and every set Fp \ Fp + 1 = Fp \ (Fp) 1 is discrete, so, by Theorem 6.2.1(h), 
it is at most countable. Hence F \ F a is a union of |a| < u> sets, each of 
which is at most countable. Therefore C = F \ F a is at most countable, 
and P = F a = F\C cannot be empty. So P is perfect. □ 

Corollary 6.2.5 The cardinality of a closed subset of W 1 is either equal 
to continuum or is at most countable. 

Theorems 6.2.3 and 6.2.4 tell us the properties that closed subsets of 
R" must have. Can we tell anything more? Exercise 1 shows that we 
cannot expect much improvement in the case of Theorem 6.2.4. But what 
about Theorem 6.2.3? How can perfect sets look? They can certainly be 
formed as a union of some number of closed intervals. But for such sets 
Theorem 6.2.3 is obvious. Are there any other perfect sets? The affirmative 
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answer is given by a set C C R to be constructed and known as the Cantor 
set (or Cantor ternary set). It is constructed as follows. 

First, we define the family {I s C P: s £ 2 <u } of closed intervals by 
induction on the length |s| of s. We put Iq = [0,1]. If the interval I s = 
[a s , b s ] is already constructed, we remove from it the middle third interval, 
and define I s o and I s i as the left interval and the right interval, respectively, 
of the remaining two intervals, that is, I s o = [a s ,a s + (b s — a s )/3] and 
I si = [b s - ( bs - a s )/3, b s ]. Put C n = U{^: |s| = n}. Thus C n is the 
union of all 2" internals of length 1 /3" constructed at step n. Notice that 
[0, 1] = Co D Ci D • • ■ and that each C n is closed, being a finite union of 
closed intervals. We define C = 

The set C is clearly closed and nonempty. To see that it is perfect, let 
p £ C and let J be an open interval containing p. Then there is an s £ 2 <w 
such that the interval I s (of length l/3l s l) contains p and is contained in J. 
Let j < 2 be such that p £ I S j. Since CC\I s j = C\ n<UJ {C n d S j) is nonempty, 
being the intersection of a decreasing sequence of nonempty compact sets, 
we conclude that Cfl J ^ {p}. So C is perfect. 

Notice also that C does not contain any nonempty interval. This is so 
since for any interval J of length £ > 0 we can find an n < u> such that 
1/3” < e. But C n is a union of intervals of length 1/3” < e. So J (jL C n , 
and J <£_ C since C C C n . 

Finally, it is worthwhile to mention that C is the set of all numbers 
from [0, 1] whose ternary representations do not contain the digit 1, that 
is, 

This will be left without a proof. 

To define Borel subsets of topological spaces we need the following def- 
initions. A family A C V{X) is a cr-algebra on A if A is nonempty and 
closed under complements and under countable unions, that is, such that 

(i) 0, X £ A ; 

(ii) if A £ A then X \ A £ A; 

(iii) if Ak £ A for every k < co then Ufc<w e 

Notice also that every cr-algebra is also closed under countable intersections, 

f| A fc = A\ |J (X\A k ). 

k<.oj k<cu 


since 



94 


6 Subsets of 1 " 


Examples For every nonempty set X the following families form u-algebras: 

(1) A = {®,X}, 

(2) A = V(X), 

(3) A = {A £ V(X) : \A\< k or | X \ A\ < k} for any infinite cardinal k. 

Notice that for every T C V{X) there is a smallest cr-algebra cr [F\ on 
X containing F, namely, 

am = flf-4 C V{X ) : T C A & A is a a- algebra on X}. (6.6) 

In particular, for every topological space (X, r) we define the cr-algebra Bor 
of Borel sets by Bor = cr[r]. 

It is obvious from the definition that the following sets are Borel: 

• Gg sets, that is, the countable intersections of open sets; 

• F a sets, that is, the countable unions of closed sets; 

• Gs a sets, that is, the countable unions of Gg sets; 

• F a g sets, that is, the countable intersections of F a sets; 
and so on. 

Is an arbitrary subset of a topological space (X, r) Borel? The answer to 
this question depends on the topological space. For example, in a discrete 
space (X,V(X)) every set is open and hence Borel. This is, however, quite 
an exceptional example and more often V(X) ^ Bor. We will not approach 
this question in general. However, we will address it in the case of K™. 

To shed more light on the structure of Borel sets, define inductively the 
following hierarchy of sets for a topological space (X,t): 


= s U (*\ Ak ^ :AkG U for a11 k < 

0 </3<a 

Define also 

n"={I\A:deS“} for 0< 

Thus 

= < U Ak : A k G for all k <u>\ for 1 < a < uj\ . 

I k<u> 0</3<a I 


for 1 < a < u)\. 


a < u\. 
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It is very easy to prove, by induction on 0 < a < u>\, that 

E°,n°c£?or for every 0 < a < W\. (6.7) 

Notice that II ) 1 is equal to the family of all closed subsets of X, E° = F a , 
n° = Gs , E° = Gsa, and n° = F aS . 

From now on we will assume that X = R". In particular, any closed 
subset F of R n is Gg, as 

F= p|{*eR" : By eF(d(y,x)<l/(k +!))}= f) |J B(y,l/(k + l)). 

k<cj k<.u) y€F 

Similarly, every open set in R" is F a . Hence 

E? C E° 2 and n? C n^. ( 6 . 8 ) 

Using these inclusions it is not difficult to prove 
Proposition 6.2.6 For every 0 < (3 < a < u>i 

yO r yO ttO r ttO yO r ttO ttO r yO 

^0 C ‘-‘a > U /3 C ^0 U a> U /3 C ^ a • 

Proof Inclusion njg C E° follows immediately from the definition of E° . 
Now, to see Ejg C n° notice that 

A e => R n \ a e c e° => a e n° . 

We prove 

Ejg C E° , C n° for every 0 < (3 < a 

by joint induction on a. So assume that for some 1 < 7 < u\ this condition 
holds for every a < 7 and let 0 < 6 < 7 . We have to prove that E° C E° 
and nj C n). 

But then (J /3<6 C U /3<7 and so 
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Theorem 6.2.7 The family Bor of Borel subsets of R” is equal to 

B°r — U S ° = U n - 

0<a<u;i 0<a<u;i 

Proof First notice that Proposition 6.2.6 implies 

U s “= U n °„. 

0<a<u;i 0<o:<a;i 

Let T denote this set. Notice also that (6.7) implies that F C Bor. Since 
T contains the topology r of R™ it is enough to show that IF is a cr-algebra, 
since Bor = cr[r] is the smallest u-algebra containing r. Thus we have to 
prove that T is closed under complements and countable unions. 

If A € T then A € E° for some 0 < a < to\ . Hence R" \ A € n° C F. 
If Ak € F for k < u) then for every k < u there exists 0 < otk < uq such 
that Ak € n° fe . But tui = u + is regular, so there exists an a < uq such 
that Ofc < a for every k < u>. Hence 

U n° 

0</3<a 

for every k < w, and so U fe<w A k G E° C F . □ 

Theorem 6.2.8 The family Bor of all Borel subsets of R™ has cardinality 
continuum. 

Proof First we will prove by induction that 

|n°| = |E° | = c for every 0 < a < uq. (6.9) 

Equation |H° | = |E° | is established by a bijection /: E° — > n° defined by 
f(A) = X \ A. Next we will prove that |E°| = c. The inequality | E^ | = 
|r| < c is justified by a surjective function f: V(B) — > r, f(U) = (J U, 
where B is a countable base for R n . The inequality |r| > c follows from the 
facts that the base B contains an infinite subfamily Q of pairwise-disjoint 
sets and that / restricted to V(Q) is one-to-one, while \V(Q)\ = c. 

Now assume that for some 1 < a < u>\ we have |n^| = |E°| = c for all 
0 < ft < a. We have to prove |E° | = c. 

But, by the inductive hypothesis, I U/3< a ^^1 = c since it is a union 

of | a | < to i < c sets of cardinality c. Moreover, F: 0«x n i) ->■ 

defined by F((A 0 , A lt . . .)) = \J n<u Ai is onto E° , so 

) UJ UJ 

= U n d =(2")" = c. 

(3<a 
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Since c = |S°| < |S° |, condition (6.9) has been proved. 

Now, by Theorem 6.2.7, Bor = Uo<a<wi is a un i° n of uq < c sets of 
cardinality c, that is, it has cardinality continuum. □ 

Corollary 6.2.9 There is a non-Borel subset of K™. 

Proof By Theorem 6.2.8 we have 

|P(R n )| = 2' Rn l =2 C > c = | Bov | , 


that is, P(R n ) \ Bor ^ 0. □ 

We will construct some explicit examples of non-Borel sets in the next 
section. 


EXERCISES 

1 For a closed set F C R let { F a : a < «i} be a sequence from the proof 
of Theorem 6.2.4. Show that for every a < u>i there exists a closed subset 
F CR for which F a+ 1 ^ F a . 

2 Prove that the family <j[F\ defined by (6.6) is indeed a cr-algebra on X. 

3 Show that Theorem 6.2.7 remains true for an arbitrary topological space. 

4 Prove the following generalization of Lemma 6.1.6. 

Let X be a set and let F be a family of functions of the form / : X a — » 
X, where a < u>. If \F\ < c then for every Z C X there exists a Y C X 
such that 

(i) Z cY- 

(ii) Y is closed under the action of F, that is, for every / : X a — » X from 
F and S € Y a we have f(S) € Y : 

(hi) |y| < \z\ w + c. 

Notice that this fact (used with X = Z = r, and F composed 

of countable unions and complements) implies Theorem 6.2.8. 


5 Prove that every uncountable Borel subset of R contains a perfect subset. 
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6.3 Lebesgue-measurable sets and sets with 
the Baire property 

To construct the most useful cr-algebras we need the following definitions. 

Recall, from Section 2.1, that a nonempty family X of subsets of a set 
A is an ideal on X if it is closed under the subset operation and under 
finite unions, that is, such that 

(i) if A, B £ X then A U B gl; 

(ii) if A € X and B C A then B £ X. 

An ideal X is said to be a cr-ideal if it is closed under countable unions, 
that is, if 

(i') Ufceo; Ak G X provided Ak £ X for all k < to. 

The elements of an ideal on X are usually considered as “small” in some 
sense. 

Examples 1. X={0}isa cr-ideal on every set X. 

2. For every A C X the family X = V(A) forms a cr-ideal on X. In the 
case when A = X we obtain X = V(X). This ideal does not agree with our 
intuitive notion of a family of small sets. Thus, usually we will work with 
the proper ideals on X, that is, the ideals that are not equal to V{X). 

3. For every infinite cardinal number n and every set X the family X = 
[A] <K is an ideal on X. For n = u> this is the ideal of finite subsets of X. 
For cf(ft) > w it is also a cr-ideal. Notice also that for k = uq this is the 
ideal [A]- w of at most countable subsets of X. 

4. For every topological space (X, r) the family AfV of all nowhere-dense 
subsets of X (i.e. , the subsets S of X such that int(cl(S)) = 0) is an ideal 
on X. 

5. For a topological space (X. t) the family 

A4 = < |^J Ak : Ak is nowhere dense in X 

V k<uj 

is a cr-ideal on X. If X = R" (or, more generally, A is a complete metric 
space or a compact space) then, by the Baire category theorem, M is 
proper. If moreover X does not have any isolated points, then [A']- w C AT 
The ideal A4 is usually called the ideal of Meager (or first- category) subsets 
of A. 

6. For n-dimensional Euclidean space R" we say that A c R" is a 
(Lebesgue) measure-zero set or a null set in R™ if for every e > 0 there 
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is a family of open balls {B(xk, r*) : k < u} such that X C Ufe<^ B(xk, rtf) 
and J2k<w r k < £ - The family 

Af = {X C IT : X is a null set in R”} 

forms a proper tr-ideal in R" called the a-ideal of (Lebesgue) measure-zero 
sets or null sets. 

The ideals A4 and A f on R n are of main interest in real analysis. It is 
easy to see that every countable set belongs to both M and Af. Also, for 
n = 1, the Cantor set C (see Section 6.2) is in A4CiAf. It belongs to M since 
it is closed and does not contain any nonempty open interval. It belongs 
to Af since the total length of intervals forming C n D C is 2”^ = (|) . 

In what follows we will study cr-ideals A4 and A f from the set-theoretic 
point of view. It is worthwhile to notice, however, that both these ide- 
als measure “smallness” in a very different sense. Ideal A4 describes the 
smallness in a topological sense whereas Af does so in a measure sense. The 
difference between these two “smallness” notions is best captured by the 
following example. 

Proposition 6.3.1 There exists a dense G$ set G C R" such that G £ Af 
and R" \G £ A4. 

Proof Recall that Q™ is a countable dense subset of R". Let {qk : k < u>} 
be an enumeration of Q". For in < uj let 

G m = |J B(q k , 2-( m+fe )/™) 

k<fuj 


and let G = f| m<w G m . 

G £ Af since for every e > 0 there exists an m < u> with < e 

and 

GcG m = |J B( % ,2-( m + fc )/ n ), 

k<uJ 

whn e £fc<a, [2-( m+fc )/"] n = 2 _ ( m_1 ) < e. 

To see that R n \ G £ A4 notice that 

R n \ G = R” \ f| G m = |J (R" \ G m ) 

m<oj m<uj 


and each of the sets R™\G m is closed and nowhere dense, since G m is open 
and dense. □ 
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To continue our journey through the special subsets of R" we need the 
following definitions and constructions. For a cr-ideal X on a set A' define 
a binary relation ~x on V(X) by 

A ~x B AAB £ X. 

Notice the following easy facts. 

Proposition 6.3.2 LetX be a a-ideal on X. 

(i) For every A, B c X, 

A ~x B if and only if A \ D = B \ D for some D £ X. 

(ii) ~x is an equivalence relation on V(X). 

(iii) If A ~x B then X \A~ x X\B. 

(iv) If A n ~x B n for every n < w then U„< w A n ~z U„< w B n- 

Proof (i) If A ~x B then D = AAB £ X and A\D = B\D. Conversely, 
if A \ D = B \ D for some D £ X then AAB C D £X. 

(ii) It is easy to see that A ~x A and that A ~x B implies B ~x A for 
every A,Bc X. Now, if A ~x B and B ~x C then A\D = B\D and 
B\E = C\E for some D,E £X. So A\ (DUE) = B\(DUE) = C\(DUE) 
and D U E £ X. So A ~x C. 

(iii) If A ~x B then A \ D = B \ D for some D £ X and so (A \A)\D = 
(A \B)\ D. Hence A \ A ~ x X \ B. 

(iv) Let A ni B n c A be such that A n ~x B n for every n < to and let 
D n Gibe such that A n \D n = B n \D n for n < to. Then D = (j n<u> D n £ X 
and so U„< w A n \D = U„<„ B n \ D. Therefore U„ <aJ A n ~x [J n<u] B n . □ 

Theorem 6.3.3 LetX be a cr-ideal on X and let A C V(X) be nonempty. 
The family 

A[I\ = {AAD: A£ Ah D £X} 
forms a a-algebra on X if and only if 

(1) A \ A £ A\I\ for every A £ A, and 

(2) Un<aj A n G A[X\ provided A n £ A for every n < ui. 

In particular, if A is a a-algebra on X then A\I\ is a a-algebra generated 
by A U X. 



6.3 Lebesgue-measurable sets and sets with the Baire property 


101 


Proof The implication => as well as the additional part of the theorem 
are obvious. To prove the other implication we have to prove that A\I\ is 
closed under complements and under countable unions. First notice that 

C~ T Bh BeA[l\ => C£A[X], (6.10) 

since B £ A[T\ implies that B ~x A for some A £ A, so C ~x A and 
C = AAD £ A[l\, where D = AAC £ 1. 

Now, if B = AAD for some A £ A and D £ X then BAA £ X and, by 
Proposition 6.3.1(iii), X\B ~x X\A. But, by our assumption, X\A £ A\I\ 
and so, by (6.10), X \ B £ A\I]. 

Similarly, if B n = A n AD n for some A n £ A and D n £ X then B n AA. n £ 
X and, by Proposition 6.3.1(iv), [j n<u; B n ~x U„< w A n . Since U n<u , A n G 
A\I\, condition (6.10) implies that U„< w B n G A[X\. □ 

The most important cr-algebras on 1" generated as in Theorem 6.3.3 
are the cr-algebras C = Bor[Af] of Lebesgue-measurable sets and the <r- 
algebra Baire = Bor[Xt] of sets with the Baire property, where Bor stands 
for the cr-algebra of Borel subsets of R n . Both these families are very rich. 
In fact, without use of the axiom of choice we cannot prove the existence 
of a subset of K™ that does not have the Baire property. A similar theorem 
for the family C of all measurable sets can also be proved. 

In what follows we prove that there is a subset of 1" that is nonmea- 
surable and does not have the Baire property. For this, however, we need 
some structural theorems about Baire and C. 

Theorem 6.3.4 Baire = t[M\ = {U AM : U is open in R" and M £ M}. 

Proof This follows immediately from Theorem 6.3.3 since a union of open 
sets is open and since for every open set A its complement F = R” \ A 
is closed, and F = int(F) U (F \ int(F)), where F \ int(P) is closed and 
nowhere dense. □ 

The similar characterization for C is more complicated and is given by 
the next theorem. 

Theorem 6.3.5 For every A £ C there exists an F a set F and a Gs set 
G such that F c A c G and G\F £ A f. 

In particular, 

C = {GAN : G is Gg in M" and N £ Al} 

= {FAN: F is F a in 1" and N £ A I}. 

The property described in this theorem is called the regularity of the 
family C and is a basic fact about Lebesgue measure. We will leave it here 
without proof (see, e.g., Royden 1988). 
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Theorem 6.3.6 If A £ C\Af or A £ Baire\M then there exists a perfect 
set P such that P C A. 

Proof First assume that A £ C\Af. Then, by Theorem 6.3.5, there exists 
an F a set F such that F C A and A \ F £ A f. Thus F £ C\Af . Let 
F = U n <u ^n> where the sets F n are closed in R”. Notice that at least one 
of the sets F n must be uncountable, since otherwise F would be at most 
countable and then it would belong to Af . So let F n be uncountable. Then, 
by the Cantor-Bendixson theorem (Theorem 6.2.4), there is a perfect set 
P C F n C F C A. 

Now assume that A £ Bair e\ Ad and let A = UAS for some nonempty 
open set U and S £ A4. Let S = (J n < w S n for some S n £ AfV. 

As in Theorem 6.2.3 we will use a “tree-construction” argument to 
define a family {U s C U: s £ 2 <u } of nonempty open sets; that is, the 
construction will be done by induction on the length |s| of a sequence s. 

We choose a bounded U% such that cl(E/ 0 ) C U \ Sq and continue the 
induction, maintaining the following condition to be satisfied for every 
s £ 2<“: 

(I) U s q and Usi are open balls such that 

d(£/ao) n cl(C/ s i) = 0, cl (U s0 ) u cl(£/ sl ) c U s \ S M . 

To make an inductive step assume that U s is already constructed. Since 
cl(5| s |) £ AfV we have U s \ cl(S| s |) ^ 0 and we can find two open disjoint 
balls in XJ S \ cl(5| s |). Decreasing their radii, if necessary, we can satisfy 
condition (I). This completes the construction. 

Now let F n = U{cl(C/g): s <E 2 < “, |s| = n} and F = f| n<u F n- By 

our construction, F C U and F n fi S n = 0 for every n < to. Hence, 
F C U\S C A. Also, each of the sets F n is compact, being a finite union of 
closed bounded sets. Thus F is compact. To finish the proof it is enough to 
show that F is uncountable, since then, by the Cantor-Bendixson theorem, 
there is a perfect set P C F C A. 

To see this, for every / € 2 W consider a set Ff = C\ n<u cl(U f\ n ) C 
F. Notice that, by Theorem 3.3.1, each set Ff is nonempty. Note also 
that the sets {Ff. f £ 2“} are pairwise disjoint. Indeed, if /, g £ 2“ 
are different, n = min{* < lo: f(i ) ^ ^(i)}, and s = f\ n = g\ n then 
Ff fl F g C cl(H s o) n cl(t/ s i) = 0. Now let c: — > F be a choice function 
for the family {Ff : f £ 2“}, that is, c(f) £ Ff for every f £ 2“. Then c is 
one-to-one and so |2“| < \F\, proving that F is indeed uncountable. 

This finishes the proof. □ 

We conclude this section with the following two theorems. 
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Theorem 6.3.7 There exists a set B C R” such that neither B nor W l \B 
contains any perfect subset. 

Proof First notice that the family T of all perfect subsets of R n has 
cardinality < c, since T C Bor and |Sor| = c. 

Let T = {P^ : f < c}. By transfinite induction on £ < c define the 
sequences (a^ : £ < c) and (b^ : £ < c) by choosing in step £ < c points 
^ b ^ from R" such that 

A e P $ \ ({a c : C < C}U {6 f : C < £})• 

This can be done since 

|{a c : C<£}U A: <<£}| = 2® l£| < C 

and, by Theorem 6.2.3, \P^\ = c. This finishes the construction. 

Now define B = { a £ < c}. Then for every perfect set P there is a 
£ < c such that P = P^. Hence € P D B and b$ £ P fl (R™ \ B). Thus 
P qL B and P <£_ R" \ B. □ 

The set B from Theorem 6.3.7 is called a Bernstein set. 

Theorem 6.3.8 A Bernstein subset B of R" neither is measurable nor 
has the Baire property. 

Proof To obtain a contradiction assume that B £ C. Then, by Theo- 
rem 6.3.6, B £ A f. Similarly, R” \ B £ C and R" \ B £ A f. But then 
R™ = B U (R ra \ B) £ A f, which is false. So B C. 

The proof that B Baire is exactly the same. □ 

EXERCISES 

1 Show that there exists a Bernstein set B such that B + B = R, where 
B + B = {&o + b\ : b\ £ B }. 


2 Show that there exists a Bernstein set B such that B + B is also a 
Bernstein set. 



Chapter 7 


Strange 


real functions 


This chapter is designed to help the reader to master the technique of 
recursive definitions. Thus, most of the examples presented will involve 
constructions by transfinite induction. 


7.1 Measurable and nonmeasurable functions 

Let B be a tr-algebra on R n . A function /: R™ — > R is said to be a 6- 
measurable function if / _1 ({ 7) £ B for every open set UcR. Notice that 
if / is ,8-measurable then f^ 1 (B) £ B for every Borel set B CR. This is 
the case since the family {B C R: f~ 1 (B) £ B} is a a-algebra containing 
all open sets. 

We will use this notion mainly for the cr-algebras of Borel, Lebesgue- 
measurable, and Baire subsets of R", respectively. In each of these cases 
,6-measurable functions will be termed, respectively, as Borel functions (or 
Borel-measurable functions), measurable functions (or Lebesgue-measurable 
functions ), and Baire functions (or Baire-measurable functions). Clearly, 
every continuous function is Borel-measurable and every Borel-measurable 
function is measurable and Baire. 

A function /: R" — > R is non-Borel (or non-Borel-measurable) if it 
is not Borel. Similarly, we define non-Baire(-measurable) functions and 
non- (Lebesgue-) measurable functions. 

Also recall that the characteristic function Xa of a subset A of a set X 
is defined by putting Xa(x) = 1 if x £ A and Xa(x) = 0 for x £ X \ A. 

The first theorem is a corollary to Theorems 6.3.7 and 6.3.8. 

Theorem 7.1.1 The characteristic function Xb of a Bernstein subset B 
of R" is neither a measurable nor a Baire function. In particular, there 
exists a nonmeasurable, non-Baire function from R" to R. 
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Proof The set ( Xb ) 1 ({1}) = B neither is measurable nor has the Baire 
property. □ 

Let us recall that the Fubini-Tonelli theorem says that for a measurable 
function / : R 2 — > R that is either integrable or nonnegative we have 



fx dm I 


dm i 



dm \ , 


where m 2 and mi stand for the Lebesgue measures on R 2 and R, respec- 
tively, and f x ,f v : R — ► R are defined by f x (y) = f(x,y) and f v (x) = 
f(x,y). The integrals / (/ f x dm. 1 ) dm\ and J (f f v dmi) dm\ are called 
iterated integrals. Thus, the Fubini-Tonelli theorem tells us for measurable 
functions what for the continuous functions is taught in every multivariate 
calculus course: A two-dimensional integral is equal to both iterated inte- 
grals. But what if the function / is nonmeasurable? Then we can’t talk 
about the integral // / dm 2 . However, it is still possible that both iterated 
integrals exist. Must they be equal? The next theorem, due to Sierpinski 
(1920), gives a negative answer to this question. 

Theorem 7.1.2 If the continuum hypothesis holds then there exists a func- 
tion f: [0, l] 2 — » [0,1] such that J f x dmi = 1 and f f v dm\ = 0 for all 
x,y £ [0,1]. In particular, 


/ 



dm\ = 1 


and 



dmi = 0 . 


Proof Let A be as in Theorem 6.1.8, put B = A fl [0, 1] 2 , and define 
/ = Xb- Notice that \B y \ < u> and | ( [0, l] 2 \ B) x \ < u> for all x,y £ [0, 1]. 
Thus f y {x) = 0 for all but countably many x £ R, and so / f v d.rn-\ = 
f Odmi = 0. Similarly, for every x € [0, 1] we have f x (y) = 1 for all but 
countably many y £ [0,1]. Thus J f x dm\ = J, Q ^ 1 dm.\ = 1. □ 

A set B C [0, l ] 2 for which the function / = Xb satisfies Theorem 7.1.2 
is usually called a 0-1 set. Its existence is not equivalent to the continuum 
hypothesis; that is, it might exist when the continuum hypothesis fails. 
On the other hand, the existence of the function / from Theorem 7.1.2 
cannot be proved in ZFC, which has been shown by H. Friedman (1980) 
and, independently, by C. Freiling (1986). 

EXERCISE 

1 Prove that there exists a function /: R — » R such that / -1 (r) is a 
Bernstein set for every r £ R. 
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7.2 Darboux functions 

To motivate what follows let us recall one of the most fundamental theo- 
rems of calculus - the intermediate value theorem. It tells us that every 
continuous function / : R — > R has the following property: 

(DP) For every a < b and every number y between /(a) and f(b) there is 
an x € (a, b) such that f(x) = y. 

The functions satisfying property (DP) form an important class of functions 
known as Darboux functions. In particular, we say that a function / : R — > 
R has the Darboux property (or is a Darboux function) if it satisfies property 
(DP). Thus, the intermediate value theorem says that every continuous 
function / : R — » R has the Darboux property. 

Not every Darboux function, however, is continuous. For example, the 
function f(x) = sin(l/x) for x yf 0 and /( 0) = 0 is a discontinuous Dar- 
boux function. The next theorem tells us that there are also everywhere- 
discontinuous Darboux functions (that is, Darboux functions that are dis- 
continuous at every point). For this we will need one more definition. We 
say that a function / : R — > R is strongly Darboux if 

(SD) for every a < b and every number y there is an x € ( a , b ) such that 

f(x) = y- 

Thus / is strongly Darboux if /[(a, 6)] = R for every a < b. Clearly, every 
strongly Darboux function is Darboux and everywhere discontinuous. 

Theorem 7.2.1 There exists a strongly Darboux function f: R — > R. In 
particular, f is Darboux and everywhere discontinuous. 

Proof The set A constructed in Theorem 6.1.1 is the graph of the desired 
function /. □ 

The next theorem is a generalization of Theorem 7.2.1. 

Theorem 7.2.2 Let Q be a family of real functions, Q C R R , with \Q\ < c. 
Then there exists a function f : R — » R such that f + g is strongly Darboux 
for every g £ Q . 

Proof Let T = {(g, I, r) : g € Q & r € R & / = (a, b) for some a < b}. 
Then \T\ = c, since c = |R| < | T\ < \Q\ <8> |R 3 | = c. Let {(g^,I^,r^): £ < c} 
be an enumeration of family T . By transfinite induction define a sequence 
{a;^ : £ < c} such that 

H e k\{ x ( : C <£}• 

Such a choice can be made since \I^\ = |R| = c > |£| > |{xf : £ < £}| for 
every £ < c. 
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Define f(x^) = rj — g^{x^) and extend it to all of R arbitrarily. Notice 
that it has the desired property, since for every nonempty interval ( a,b ), 
every r G R, and every g G Q there exists a £ < c such that (g%, ry) = 
(g, ( a,b),r ), and so 

r = r i = f(x i ) + gz(xz) G (/ + ^)[/j] = (/ + s)[(a, 6)]. □ 

Notice that Theorem 7.2.2 generalizes Theorem 7.2.1, since we can as- 
sume that the function that is identically zero belongs to Q. In particular, 
the function / in Theorem 7.2.2 can be chosen to be strongly Darboux. 

Corollary 7.2.3 Let Q be a family of real functions such that \Q\ < c. 
Then there exists a (strongly) Darboux function f: R — > R such that f + g 
is (strongly) Darboux for every g G Q . 

Another way to look at Corollary 7.2.3 is to use the language of cardinal 
functions as follows. Let V stand for the family of all Darboux functions 
from R to R and let A(D) be the minimal cardinality of a family Q for 
which Corollary 7.2.3 fails, that is, 

A(V) = min{|C?| : V/ G R 1 3g G Q (/ + g £ V)}. 

Notice that the set {\G \ : V/ G R R 3g G Q (/ + g & 7))} is not empty, 
since |R R | = c c = 2 C belongs to this set: For every / G R R there exists a 
g G G = R R such that / + g is equal to any fixed function h, and such 
that h can be chosen to be not Darboux. So A(D) < 2 C . In this language 
Corollary 7.2.3 can be stated as follows: 

c < A(V) < 2 C . (7.1) 

If 2 C = c+, which follows from the generalized continuum hypothesis, then 
A(D) = 2 C . However, this equation cannot be proved in ZFC alone. It 
has been proved by K. Ciesielski and A. W. Miller (1994-5) that (7.1) is 
essentially everything that can be proved in ZFC about A(V). 

In what follows we will consider some other generalizations of Theo- 
rem 7.2.1. We start by noticing that the sum of a Darboux function and 
a continuous function does not have to be Darboux. To see it, modify the 
proof of Theorem 7.2.2 to get a strongly Darboux function / such that 
g(x) = f(x) — x is not Darboux, by choosing 

xt&h\ ({^c • C < £}U{rJ) 

and defining f(x^) = r%. Then f(x^) ^ x % and we can extend / to all of 
R to have f(x) ^ x for all iGl. So g(x) = f(x) — x ^ 0 for every igR 
and Proposition 7.2.4, stated next, implies that g is not Darboux. 
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Proposition 7.2.4 If f is strongly Darboux, then for every continuous 
function g, 


f + g is strongly Darboux iff f + g is Darboux. 

Proof Implication => is obvious. The other implication follows from the 
fact that on any interval I = ( c,d ), with c < d, the continuous function 
g is bounded, while /[/] = R. So f + g must be unbounded from below 
and from above. In particular, for every y £ R there exist A < y < B and 
a,b £ I such that (/ + g)(a) = A and (/ + g)(b) = B. Thus there exists 
an x between a and b with (/ + g)(x) = y, since f + g is Darboux. Hence 

(f + g)[I] = R. □ 

The next theorem, due to Kirchheim and Natkaniec (1990-1), gener- 
alizes the preceding observation. It concerns continuous nowhere-constant 
functions, that is, continuous functions that are not constant on any open 
nonempty interval. 

Theorem 7.2.5 If the continuum hypothesis holds then there is a strongly 
Darboux function f such that f + h is not Darboux for every continuous 
nowhere-constant function h. 

Proof Let {h^: £ < c} be an enumeration of all continuous nowhere- 
constant functions from R to R. Notice that for any such function h the 
set d _1 (r) is closed and nowhere dense for every r £ R. 

By Proposition 7.2.4 it is enough to find /: R — > R such that 

V£ < c £ R Vx £ R (f(x) + hffx) ^ rj). (7.2) 

For this we will construct sequences {aif £ R: £ < c}, {y% € R: £ < c}, and 
{r^ el: £ < c} such that the function / defined by 

/(£«) = Vi (7-3) 

is strongly Darboux and satisfies condition (7.2). As we know from Theo- 
rems 7.2.1 and 7.2.2, it is relatively easy to construct the sequences {x<} 
and { y {} such that / defined by (7.3) is strongly Darboux. In previous 
constructions, however, we did not have to worry about {a:^ : £ < c} be- 
ing equal to R, since any extension of the part of / given by (7.3) was 
still strongly Darboux. But this time, we also have to take care of condi- 
tion (7.2), and the extension may fail to have this property. To avoid this 
problem we will arrange our construction to have 

{cc ? :£<c} = R. (7.4) 

Now let S and T be subsets of c of cardinality c. Let {sj : £ £ S} be an 
enumeration of R and let {(/j, t %) : £ G 7} be an enumeration of the family 
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T = {(/, t ) : t € R. & I = (a, b) for some a < b}. To satisfy (7.4) it is 
enough to have 

(SI) X£ = S£ for every (e S. 

To make sure that / is strongly Darboux it is enough to proceed as in 
Theorems 7.2.1 and 7.2.2; that is, for every £ £ T choose 

H e h \ C < £} (7-5) 


and put 

(Tl) % = i£. 

Notice, however, that for ^ I $ conditions (SI) and (7.5) cannot be 
satisfied simultaneously. To avoid this problem we will assume that the 
sets S and T are disjoint. We can also assume that S LIT = c. 

Property (7.2) will hold if we construct the sequences in such a way 
that for every £ < c 


VC < < £ (yr, + h c (x v ) ^ r f ). (7.6) 

To preserve this condition while constructing x^, and we will have 
to choose such that y v + h^( x v ) ^ rj for every y < £, that is, pick 

(*) \ {y v + h^(x v ) :?)<£}, 

and choose x % and yc such that 

j/ ? + /i c (x 5 ) ^ r c for every C < £• (7-7) 

Now for ( 6 T we have no freedom in choosing yc since, by (Tl), 
j/£ = fj. But to satisfy (7.7) it is enough to choose x £ ^ (Jc<£ — %)• 

Combining this with (7.5) we need only choose 

(T2) X£ G \ ({ai c : C < £} U Vc “ %) : C < £}) for every (eT. 

For ( € S, by (SI), we have a;^ = sj. If x £ = a:^ for some C < £ then, by 
(7.3), we have to define 

(S2) = y c - 

But then conditions (7.6) and (7.7) will be preserved. On the other hand, 
if x^ x ^ for every £ < £, i n order to maintain (7.7) it is enough to choose 


(S3) % G R\ {r v - h v {x^): rj < £}. 
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The foregoing discussion shows that the function f can be constructed 
if we can construct the sequences {a;^ € R: £ < c}, {yj G R: £ < c}, and 
{r^ el: £ < c} such that the following conditions hold for every £ < c: 


M 

r ? 

€ R \ {Uri + h^iXri) ■ 

v < ?}; 



(Tl) 

Vi 

= for £ £ T; 




(T2) 

x z 

£ h\ (w : C < £} 

ulK^cVc - 

- 22?) : C < 

£}) for £ G T; 

(SI) 


= sj for £ £ 5; 




(S2) 

22? 

= for £ £ S if x $ 

= x £ for some £ < £; 


(S3) 

22? 

G R \ {r c - h c (x^): 

£ < £} for £ ? 

E S if X£ t? for all £ < £ 


By the recursion theorem it is enough just to argue that such a choice is 
always possible. 

The possibility of the choices in (★), (Tl), (SI), (S2), and (S3) is obvi- 
ous. The choice of x £ in (T2) is possible by the Baire category theorem, 
since the setja^: £ < C}b | U{^( :1 ( r C — 2/?) : C < £} is of first category, being 
an at most countable union of nowhere-dense sets (this is the place where 
we used the continuum hypothesis!). □ 

It is worthwhile to mention that the conclusion of Theorem 7.2.5 is not 
equivalent to the continuum hypothesis. Essentially the same proof works 
with a weaker additional set-theoretic axiom, known as the additivity of 
category, that the union of less than continuum many meager sets is still 
a meager set (see Theorem 8.2.6). However, it has been recently proved 
by Steprans (1993) that the conclusion of Theorem 7.2.5 is independent of 
ZFC set theory. 


EXERCISES 

1 Show that every function / : C — > R on a nowhere-dense subset C of R 
can be extended to a Darboux function g: R — > R. 

Since there are nowhere-dense subsets C of R of cardinality c, for ex- 
ample, the Cantor set, conclude that the family V of Darboux functions 
has cardinality 2 C . 

2 Generalize Theorem 7.2.5 to prove the following: 

Let Q be the family of all continuous functions from R to R such 
that for every h in Q there exists a nonempty open interval / 
such that h is not constant on any nonempty open subinterval 
of /. If the continuum hypothesis holds then there exists a 
strongly Darboux function / such that f + h is not Darboux 
for every h £ Q . 
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3 Show that if h is a continuous function with the property that every 
nonempty open interval contains a nonempty open subinterval on which 
h is constant then f + h is strongly Darboux for every strongly Darboux 
function /. 

Remark: There are continuous nonconstant functions with the property 
just described. For example, if C is the Cantor set then we can define a 
function h: [0, 1] — » R by h(x) = x n /2 n+1 , where Y^~i x n/ 3 n = 

max(C fl [0, x\) and x n € {0, 2}. 

4 Find uncountable sets X,Y C R \ Q such that for every continuous 
function / : R — > R if /[X] C Y then / is constant. 

5 Prove the following theorem of Sierpinski and Zygmund: There exists a 
function / : R — > R such that for every continuous function g : R — » R the 
set {x € R: f(x) = g(x)} has cardinality less than c. Hint: Use the fact 
that every continuous function h : X — » R, with Icl, has a continuous 
extension h: G — > R to a Gg set Gel. (This can be proved by noticing 
that the set of all z € R for which the limit lim x ^ z h(x) does not exist is 
an F a set.) 

7.3 Additive functions and Hamel bases 

A function F : R — » R is an additive function if F(x + y) = F(x) + F(y) 
for every Every linear function F{x) = ax is clearly additive and 

continuous, and it is quite easy to see that these are the only continuous 
additive functions. However, there exist discontinuous additive functions. 
The first example of such a function was constructed by Hamel, with the 
use of a Hamel basis, that is, a linear basis of R over Q, which exists by 
Theorem 4.4.1. For its construction we will need some easy facts. 

For B C R let LINq(H) denote the smallest linear subspace of R over 
Q containing B. Notice that LINq(B) can be obtained by closing set B 
under the action of the family T of operations (x, y) i— > x + y and x > qx 
for every q £ Q. Thus LINq(B) = cl p(B), where we are using the notation 
of Lemma 6.1.6. In particular, Lemma 6.1.6(a) implies that 

|LINq(H)| = \B\ + u> for every BcK. (7.8) 

From this we immediately conclude that 

every Hamel basis H has cardinality continuum (7-9) 


since LINq(IL) = R. 
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Proposition 7.3.1 If F: R — > R is additive then it is linear over Q, that 
is, 

F(px + qy ) = pF (x) + qF(y) 
for every p,q £ Q and x,y £ R. 

Proof Let F: R — > R be additive. Then F(0) + F(0) = F( 0 + 0) = F(0), 
so 

F(0) = 0. 

It is easy to prove by induction on n £ ui that F(nx) = nF{x) for every 
x £ R. Since F{—x) + F(x) = F(—x + x) = F( 0) = 0, we have also that 
F(—x) = —F(x). Therefore we conclude that 

F(nx) = nF{x) for every iff and n £ Z. 

In particular, for 0 < m < u we have mF (-^x) = F (m^x) = F(x), that 

F(px) = pF{x) for every itl and p £ Q, 

and F(px + qy) = F(px) + F{qy) = pF{x) + qF(y) for every p, q £ Q and 
x, y £ R. □ 

Theorem 7.3.2 If H C K. is a Hamel basis then every function f : H — > R 
can be extended uniquely to an additive function F: R — > R. 

Proof Since every i£l has a unique representation x = qib± + - ■ ■+q r nb m 
in the basis H (that is, b\ < ■ ■ ■ < b m are from H and q -\ , . . . ,q m £ Q\ {0}; 
see Section 4.4), we can define 

m 

F(x) = '£q i f(b i ). (7.10) 

i = 1 

This function is clearly additive and extends /. Its uniqueness follows from 
Proposition 7.3.1. □ 

Corollary 7.3.3 There are additive discontinuous functions F: R — ■> Q. 

Proof Let H be a Hamel basis and let x £ H. Define /: H — > R by 
putting /( x) = 1 and f(y) = 0 for y £ H \ {x}. Extend / to the additive 
function F. By (7.10) the range of / is equal to Q. Thus, F cannot be 
continuous, since it does not have the Darboux property. □ 

The function from Corollary 7.3.3 has a small (countable) range. It is 
not difficult to modify the argument from Corollary 7.3.3 to get an additive 
discontinuous function onto R. The next theorem shows that we can even 
do a lot better than this. 
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Theorem 7.3.4 There exists a Hamel basis H and a function f : H — > M 
such that / _1 (r) is a Bernstein set for every r € R. 

Proof First notice that if / : H — > R is such that 

f~ 1 (r) flP^0 for every r € R and perfect set P Cl 

then / has the desired property, since the complement of each / _1 (r) con- 
tains / _1 (r + 1) and so it must also intersect every perfect set. 

Let {(P{, r^) : £ < c} be an enumeration of {P C R: P is perfect} x R. 
We will construct, by induction on £ < c, a one-to-one sequence {a^ : £ < c} 
that is linearly independent over Q and such that x j € Pj for every £ < c. 

Notice that this will be enough to construct function /. To see it, extend 
{x^: £ < c} to a Hamel basis (see Theorem 4.4.1). Define /(x {) = rj for 
£ < c and extend f arbitrarily to H. Then for every perfect set P C M 
and every r € R there exists a £ < c such that (P, r) = (Pj,r^), so that 
/(x = r^ = r and x^ € P^ = P. Therefore / _1 (r) fl P yf 0. 

To make the sequence {x^ : £ < c} linearly independent over Q for every 
(<cwe will choose 

(i) x ( eP ? \LIN Q ({x c : C<£»- 

The choice is possible since, by (7.8), |LINq({x^: £ < £})| = |£| + to < 
c. Thus, by the recursion theorem, we can find a sequence satisfying (i). 
To finish the proof it is enough to show that the choice of x^s from the 
complement of LINq({x^: ( < £}) makes the sequence { x £ < c} linearly 
independent over Q. This part is left as an exercise. □ 

Corollary 7.3.5 There exists a nonmeasurable non-Baire additive func- 
tion F : R — > R that is strongly Darboux. 

Proof Let F be an additive extension of / from Theorem 7.3.4. Then 
P -1 (r) is a Bernstein set for every r £ R. □ 

In fact it can be proved that every discontinuous additive function is 
neither measurable nor Baire. 

Corollary 7.3.6 There exists a Hamel basis H that is neither measurable 
nor has the Baire property. 

Proof It is enough to notice that the Hamel basis H from Theorem 7.3.4 
is a Bernstein set. 

It clearly intersects every perfect set. On the other hand, if a € H, then 
a + H = {a + x: x £ H} is disjoint from H. This is the case since otherwise 
there would exist x,y € H such that a + x = y and so {a, x,y} C H would 



114 


7 Strange real functions 


be linearly dependent. Thus a + H C R \ H. But a + H intersects every 
perfect set P, since Hn(— a+P) yf 0 for every perfect set P, the set — a+P 
being perfect. □ 

Notice that there are Hamel bases that are measurable and have the 
Baire property (see Exercise 2). 

Corollary 7.3.7 There exists a Hamel basis H such that \H n P| = c for 
every perfect set P. 

Proof It is clear that the basis H from Theorem 7.3.4 has the desired 
property, since HC\P = Ur6R(/ (r) n P) is the union of continuum many 
nonempty disjoint sets. □ 

Next, we will construct an additive function P whose graph is connected 
as a subset of the plane. The graph of such a function is called a Jones 
space. 

Theorem 7.3.8 There exists a discontinuous additive function F: R — > R 
whose graph is connected. 

Proof Let H be the Hamel basis from Corollary 7.3.7 and let T be 
the family of all closed subsets P of the plane such that p[P\ contains 
a perfect set, where p is the projection of the plane onto the x-axis. Let 
{P^: £ < c} be an enumeration of T. Define, by induction on £ < c, a 
sequence {(x^,y^) € R 2 : £ < c} such that 

(i) G (Pnp[P ? ]) \{;r c : < < £}, 

(ii) (^,%)ePj. 

The choice as in (i) can be made, since | H flp[P^]| = c. 

Now define f(x^) = y £ and extend / to H in an arbitrary way. Let P be 
the additive extension of /. We will show that the graph of P is connected. 

To obtain a contradiction assume that the graph of F is not connected. 
Then there exist disjoint open subsets U and V of R 2 such that U fl F 
0 / V flF and P C U U V. Let P = R 2 \ (U U V). We will show that 
P fl F yf 0, which will give us the desired contradiction. 

If P € T then there exists a £ < c such that P = P{, and so (x^,y^) G 
Pj fl P = P fl P. Thus it is enough to prove that P € T . Clearly, P is 
closed. So we have to show only that 

p[P] contains a perfect subset. 

In order to prove it we first show that there exists an x € R such that 

Pn({i}xR)/0/yn({4xR). ( 7 . 11 ) 
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To obtain a contradiction, assume that (7.11) is false. Then the sets U\ = 
p[U] and V\ = p[V ] are disjoint. But they are open in R, since the projection 
of an open set is open. They are clearly nonempty, since U and V are 
nonempty. Finally, U\UVi = p[U Ub] D p[F] = R, that is, sets U\ and V\ 
violate the connectedness of R. Condition (7.11) has been proved. 

Now let xq € R be such that U fl ({a;o} x R) / 0 / V fl ({^o} x 
and let j/ 1 , 2/2 G R be such that (xo,yi) G U and ( £ 0 , 2 / 2 ) G V. Then there 
exists an open interval I = ( a,b ) containing x$ such that / x { j/i } C U 
and / x {y 2 } C V. But for every x G I the interval J x = {x} x [2/1 , 2/2] 
is a connected set intersecting both U and V. Thus J x must intersect P 
as well, since it is connected. Therefore {x} = p[J x fl P] C p[P] for every 
x G I, that is, p[P\ D I. This finishes the proof that P G T . □ 

We will finish this section with following theorem of Erdos and Kakutani 
(1943). 

Theorem 7.3.9 The continuum hypothesis is equivalent to the existence 
of a countable partition { H n : n < co} of R \ {0} such that every H n is a 
Hamel basis. 

In the proof of the theorem we will use the following two lemmas, the 
first of which is due to Erdos and Hajnal (Erdos 1978-9). They are also 
very interesting in their own right. 

Lemma 7.3.10 If f : A x B — > to, where |T| = u > 2 and \B\ = u> 1 , then for 
every n < to there exist Bq g [B] n and Aq g [A]“ 2 such that /(ao,&o) = 
/(ai,&i) for every ao,a\ G A 0 and b 0 ,bi G Bo- 

Proof Fix n < u). First we will show that 

VaGd 3B a G [B] n 3m a < u> V6 G B a (/(a, b ) = m a ). (7-12) 

To see this, notice that for every a £ A the sets S'™ = {b G B : /(a, b) = m} 
form a countable partition of an uncountable set B. Thus there is an 
m = m a for which the set S™ is uncountable and any n-element subset B a 
of S™ satisfies (7.12). 

Now let F: A —> [B]" x w be such that for every a £ A the pair 
F(a) = { B a ,m a ) satisfies (7.12). The set [B] n x u> has cardinality uj\, so 
there exists (B 0 , m) G [B] n x to such that A 0 = F~ 1 (B 0 , m) has cardinality 
u> 2 - But then, if a,; G Aq for i < 2 then F{af) = (Bo,m) and 

f(a>i,b) = m a,i = m 

for every b G B ai = B 0 . So the sets A 0 and B 0 have the desired property. 

□ 
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Notice that Lemma 7.3.10 can also be expressed in the following graph- 
theoretic language. Let A and B be disjoint sets of cardinality u >2 and uji, 
respectively, and let G = {V, E) be the bipartite graph between A and B, 
that is, V = A U B and E = {{a, b}: a € A & b € B}. Then for any n < oj 
and any coloring /: E — » oj of the edges of graph G there are A 0 £ [H]“ 2 
and B 0 £ [B] n such that the subgraph Go = (H 0 U B 0 , E n V(A 0 U B 0 )) of 
G generated by Ao U Bo is monochromatic, that is, the coloring function / 
is constant on the edges of Go- 

Lemma 7.3.11 Assume that c > oj\ and let H £ [R]“ 2 . Then for every 
partition { H n : n < uj} o/R there exist n < oj and disjoint sets Aq £ [H] U2 
and Bq £ [H] 2 such that 

a + b £ H n for every a £ Ao and b £ Bq- 

Proof Choose disjoint A £ [ff]^ 2 and B £ [H] Ul . Define /:ix5-iw 
by 

f(a, b) = m if and only if a + b £ H rn . 

Then the sets Ao and Bo from Lemma 7.3.10 used with / and n = 2 have 
the desired properties. □ 

Proof of Theorem 7.3.9 <=: Let H be any Hamel basis and suppose that 
there exists a partition {H n : n < oj} of R \ {0} into Hamel bases. 

To obtain a contradiction assume that c > oj\. Then \H\ = c > U 2 so 
we can choose Ho and Bo as in Lemma 7.3.11. Take different ao,a± £ Ao 
and bo,bi G Bo- Then the numbers x-ij = Oj + bj for i. j £ 2 are different 
and belong to the same H n . However, 


xoo ~ xio = (a 0 + bo) - (a i + b 0 ) = (a 0 + h) - (ai + h) = x 0 i - Xn , 

contradicting the fact that Xoo,xio,Xoi,xn £ H n are linearly independent 
over Q. 

=>: Let {zj £ R: £ < u>i} be a Hamel basis and for every £ < tO\ define 
Lj = LINq({^: C < £})• Notice that C for £ < p < u>i, that 

L\ = for limit A < u>i, and that L Ul = R (compare this with 

Lemma 6.1.6). Thus the sets {L^+i — : £ < u>i} form a partition of 

R \ L 0 = R \ {0}. For every £ < choose a bijection : L^ +1 — — * u 
and let / = U 5 < a ; 1 /«• Then /: R \ {0} -> us. Let H n = f~ 1 (n). 

The sets {H n : n < oj} form a partition of R\ {0}. To finish the proof 
it is enough to show that every H n is a Hamel basis. 

So choose n < u and let { x {\ = (L{+i — Lf) n f~ 1 {n ) for £ < uj\. It is 
enough to prove that 


L v = LINq({x c : £ < rj}) 


(7.13) 
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for every 77 < t 0 \, since then H n = £ < c} spans R and { x £ < c} 

is linearly independent by Exercise 1. Condition (7.13) can be proved by 
induction on rj < u> 1 . So let a < u>\ be such that (7.13) holds for every 
77 < a. 

If a is a limit ordinal then 

L a = (J L v = (J LINq({x c : C < »?}) = LINq({:e c : C < a}). 

T]<iOt T)<Qt 

If a = 77 + 1 then L v = LINq({x^ : £ < 77}) and 

{x v } U LIN Q ({a; f : ( < 77 }) = {x v } U L v C L v+ 1 
so LIN Q ({a; c : C < V + 1}) C L v+1 . 

To prove the other inclusion, recall that x v £ Lri+i — Lri- Therefore 
x v = qz v + Y2i< m for some <?o, ■ ■ ■ , Qm-i € Q and b 0 , . . . , bm- 1 G 
{#£ : £ < tj}- Moreover, q ^ 0 because x v g L v = LINq({x^ : ( < rj}). So 

z v = - ( x v - qA J e LINq({x^ : C < 7? + 1}). 

^ V i<m J 

Now {z v } U L v C LINq({;e£: £ < V + 1}), and so we can conclude that 
L v+ i C LINq({;e ? : C < 77 + 1 }). □ 

EXERCISES 

1 Complete the proof of Theorem 7.3.4 by showing that if {rr^ e I: ( < c) 
is such that x j ^ LINq({:e£: £ < £}) for every £ < c, then { x % £ R: £ < c} 
is linearly independent over Q. 

2 Let CcKbe the Cantor set. Show that there is a Hamel basis contained 
in C. Since C is nowhere dense and has measure zero, it follows that Hamel 
bases can be measurable and have the Baire property. Hint: Use the fact 
that C + C = {a + b: a,b £ C} contains the unit interval [0, 1]. 

3 Let ~ be an equivalence relation on R defined by x ~ y if and only if 
x ~ y £ Q. If V is a selector from the family of all equivalence classes of 
~ then V is called a Vitali set. It is known that any Vitali set is neither 
measurable nor has the Baire property. 

Construct a Vitali set V such that 

(a) V + V = R, 

(b) V + V is a Bernstein set. 

4 Prove that the graph of any additive discontinuous function F : R — > R 
is dense in R 2 . 
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7.4 Symmetrically discontinuous functions 

This section is motivated in part by the following generalization of the con- 
tinuity of real functions. A function /: R — > R is said to be symmetrically 
continuous at the point x G R if 

. + [f(x - h) - /( x + h)] = 0, 

that is, if for every £ > 0 there exists a d > 0 such that 

(0, d) C S%, 

where S% = {h > 0: \f(x — h) — /( x + h)\ < e}. A function /: R — > R is 
symmetrically continuous if it is symmetrically continuous at every point 
ieR. 

Clearly, every continuous function is symmetrically continuous. The 
converse implication is not true, since there are symmetrically continuous 
functions that are not continuous in the usual sense. For example, the 
characteristic function X{ 0 } of a singleton set {0} is symmetrically contin- 
uous and discontinuous at 0. However, symmetrically continuous functions 
cannot behave too badly in this respect: It can be proved that the set of 
points of discontinuity of a symmetrically continuous function must be of 
first category and have measure zero. 

The study of symmetrically continuous functions is an important sub- 
ject in real analysis, motivated by Fourier analysis. In this section, how- 
ever, this notion serves merely as a motivation, since we will study here the 
functions that are not symmetrically continuous. 

It is easy to find a function that is not symmetrically continuous. For 
example, a characteristic function of any nontrivial interval is symmetri- 
cally discontinuous at the endpoints of that interval. In fact, it is also not 
difficult to construct a function / : R — > R that is nowhere symmetrically 
continuous. Such a function must have the property that for every x G M 
there exists an £ > 0 such that 

\/d > 0 3h G (0 ,d) {\f(x - h) - f(x + h)\ >e), (7.14) 

or, equivalently, such that 

(0,d) t S e x for every d > 0. (7-15) 

To get such an example define / as the characteristic function Xh of any 
dense Hamel basis H. Notice that such a basis exists by Theorem 7.3.4. To 
see that Xh is nowhere symmetrically continuous, take x G R, £ £ (0,1), 
and an arbitrary d > 0. We have to find h G (0,d) with the property 
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that \Xh(x — h) — Xh( x + h)\ > e. So let q\b\ + • • • + q n b n be a repre- 
sentation of x in basis H . Then, by the density of H , there exists a 
b £ H fl {x — d, x) \ {bi, . . . , b n }. Pick h = x — b. Then h £ (0, d) and 
Xh (x — h) = Xn{b) = 1. On the other hand, Xh(x + h) = 0asx + h = 
2 x — b = 2 qibi + ■ ■ • + 2 q n b n — bfLH. Therefore | Xji(x — h) — Xh{x + h)\ = 
1 > £. 

Considerations of how badly nowhere-symmetrically continuous func- 
tions can behave led several people 1 to to ask whether there exists a func- 
tion / satisfying (7.14) with the quantifiers Vd > 0 3h € (0 ,d) replaced by 
the reversed quantifiers 3d > 0 Vh G (0, d), that is, whether there exists a 
function / : R — > M such that for every it! there exists an e > 0 with 

3d > 0 VO < h < d (| f(x — h) — f(x + h)\ > e), 

or, equivalently, such that 

(0, d) fl S% = 0 for some d > 0. 

Replacing e and d by their minima, we can rephrase this problem by asking 
whether there exists a function / : M — > R such that 

(*) for every i£l there exists a d > 0 with 

(0 ,d)ns* = 0, (7.16) 

where Sf = {h> 0: | f(x — h) — f{x + h)\ < d}. 

A function / satisfying (★) will be called a uniformly antisymmetric func- 
tion. 

The existence of uniformly antisymmetric functions can be inferred from 
the following theorem due to Ciesielski and Larson (1993-4). 

Theorem 7.4.1 There exists a partition V = {P n : n £ N} of IS. such that 
for every the set 

S x = \J{h>0: x-h,x+h£ P n } (7.17) 


is finite. 

Before proving this theorem, we first show how to use it to construct a 
uniformly antisymmetric function. 

Corollary 7.4.2 There exists a uniformly antisymmetric function 
f: R->N. 


1 Evans and Larson in 1984; Kostyrko in 1991. 
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Proof Let V = { P n : n G N} be a partition satisfying (7.17) and for x € R 
define 

/( x) = n if and only if x £ P n . 

Then / is uniformly antisymmetric, since for every i£l the set 

Sl = {h> 0: \f(x—h)—f(x+h)\ < 1} = {h > 0: f[x-h ) = f{x+h)} = S x 

is finite, so d = min(5 x U {1}) satisfies (*), as S x = S x . □ 

Proof of Theorem 7.4.1 Let H be a Hamel basis. Then every x £ 
R has a unique representation x = <7i&i + • • • + q n b n i n the basis H 
(that is, b\ <■■■ <b n are from H and qi, . . . ,q n € Q \ {0}). Let B x = 
C H and define c x : B x — > Q by putting c x (bi) = qi for 
1 < i < n. Moreover, extend each c x to c x : H — » Q by putting 


c x (b) 


c x (b) for b £ B x , 
0 otherwise. 


Then, for every itR, 


x = ^2 c *( fe ) b = ^2 b • 
beB x beH 

We will start the argument by proving that there exists a countable set 
D = {g n : n < uj} of functions from H into Q such that 

Va: £ R 3n < to (c x C g n ). (7.18) 

To see it, let U = {(p, q) fl H : p, q £ Q} and notice that U is countable. 
Define 

D = < qiXu i £ Q h : n £ ui and {q t , Uf) £ Q x U for every i < n 

\ i<n 

Clearly, D is countable, since it is indexed by a countable set U ra <o>( < Q x ^0" 
To see that it satisfies (7.18), take i£l and find a family {Ub £U:b£ B x } 
of disjoint sets such that b £ Ub for every b £ B x . Then g = X^6es x Ca; (^) •*- u b 
belongs to D and c x (b) = g(b) for every b £ B x . So c x C g. Condition 
(7.18) has been proved. 

Now let { g n : n < w} be as in (7.18). Define /: R — > to by 
f{x) = min {n <oj: c x C g n } 

and let P n = / _1 (n) for every n < lo. We will show that the partition 
V = {P n : n £ N} of R satisfies (7.17). 
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So fix x G R and let h £ S x . We will show that 

c x +h(b ) G {0,c x (b),2c x (b)j for every 6 G H. (7-19) 

This will finish the proof, since {0, c x (b), 2 c x (b)} = {0} for b G H\B X and 
so there are at most 3l Sx l < u> numbers x + h such that h G S x . Thus S x 
is finite. 

To see (7.19), notice that (x + h) + (x — h) = 2x, so 

T. C x+h (b) b + y c x -h(b) 6 = 2 y c x (b) b. 

b£H b£H b£H 

In particular, 

c x+ h(b) + c x _ h (b) = 2c x (b) (7.20) 


for every b G H. 

Now, if x + h,x — h £ P n then c x +h C g n and c x -h C g n . Hence 

c x +h{b) = g n {b) = c x -h(b) for every 6 G B x+h n B x _ h . 

Therefore, by (7.20), c x+h (b) = c x (b) for every b G B x+h n B x _ h . But for 
6 G H\ B x+ h we have c x+ h(b) = 0, and for every 6 G B x+ h \ B x _h we have 
Cx-\-h(S) — Cx+h(b^) T c x — h(b) — 2c x ( 6) . Thus c a ,_(_/ l (6) G {0, 0^(6), 2c x (b^)} for 
every 6 G H . This finishes the proof of condition (7.20) and Theorem 7.4.1. 
□ 

An interesting open problem is whether or not there exists a uniformly 
antisymmetric function with finite or bounded range. Some partial results 
are known in this direction. Komjath and Shelah (1993-4) proved that 
there is no function /: R — > R with finite range such that all the sets S x 
from Theorem 7.4.1 are finite. It has also been proved by Ciesielski (1995- 
6) that the range of any uniformly antisymmetric function must have at 
least four elements. The next theorem, due to Ciesielski and Larson (1993- 
4), shows only that the range of a uniformly antisymmetric function must 
have at least three elements. 

Theorem 7.4.3 ///: R — > {0, 1} then f is not uniformly antisymmetric. 

Proof To obtain a contradiction assume that there exists a uniformly 
antisymmetric function /: R — > {0, 1} and for every x G R let n x G N be 
such that for every h G (0, 1 /n x ), 

\f(x - h) - f(x + h) | > l/n x . 

Fix bGN such that L = {x G M: n x = n} is uncountable. Then 

f(x — h) ^ f{x + h) for every x G L and h G (0, 1/n). (7.21) 
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Since {(£, ^±1] fl L: k £ Z} forms a countable partition of L , at least one 
of these sets must be uncountable. In particular, we can choose x,y, z £ L 
such that x < y < z < x + 1 /n. Now we would like to find points a, b, c € R 
such that x, y, and z are the midpoints of pairs (a, 6), (6, c), and ( a,c ), 
respectively, that is, such that a + b = 2x, b + c = 2y, and a + c = 2z. It is 
easy to see that the points 


a = x — y + z, b = x + y — z, c = —x + y + z 

have these properties. Moreover, for h = z — y £ (0, 1/n) we have a = x + h 
and b = x — h. Hence, by (7.21), /(a) ^ f(b). Similarly, f(b) ^ /(c) and 
/(c) ^ /(a). But this is impossible, since the points a, 6, and c are distinct 
and / attains only two values. □ 

We proved Corollary 7.4.2, via Theorem 7.4.1, by showing that for every 
the set S x = S x of exceptional points, being finite, is bounded away 
from 0. What if we allow the sets S x to be countable? Certainly, such sets 
do not have to be bounded away from 0. But we can replace the condition 
of being “bounded away from 0” by the weaker condition of being “almost 
bounded away from 0,” in the sense that S x fl (0, d) is countable for some 
d > 0. Can we prove then an analog of Theorem 7.4.1? Surprisingly, 
the answer depends on the continuum hypothesis, as is proved in the next 
theorem. 

The general form of this theorem is due to Ciesielski and Larson (1993- 
4). However, implication (i)=>(ii) was first proved by Sierpinski (1936). 
The equivalence of (i) and (ii) is also implicitly contained in a work of 
Freiling (1989-90). 

Theorem 7.4.4 The following conditions are equivalent. 

(i) The continuum hypothesis. 

(ii) There exists a partition V = {Ho, Hi} of R such that for every itl 
the set S x = Ui<2 {/i. > 0: x — h 7 x + h £ H,} is at most countable. 

(iii) There exists a function f : K. — > {0, 1} such that for every iff there 
is a d > 0 with the property that fl (0, d) | < ui. 

Proof (i)=>(ii): Let H = { b q: ( < uq} be a Hamel basis. For i£l\ {0} 
let q{x) = q n , where x = qib^ + ■ ■ ■ + q n bc, n is the unique representation of 
x in the basis H , with Ci < • • • < Cn and <??: 7^ 0- Put 


x £ Ho if and only if q{x) > 0, 

and Hi = R\Hq. We will show that the partition V = {Hq, Hi} satisfies (ii). 
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For £ < to\ let K \ = LINq({&^: £ < £}). Notice that R = Uf <aJi 
and that every K / is countable. We will show that if x £ K j then S x C K /. 
This will finish the proof. 

But it is easy to see that for h £ R \ K ^ and x £ 

h £ A 0 if and only if x + h £ A 0 , 

since q(h) = q(x + h). However, h and —h cannot belong to the same Ai 
for h yf 0, so S x C iQ. 

(ii) =>(iii): For x £ R define f(x) = i if x £ Ai. Then Sf = S x is 

countable for every and d £ (0, 1). 

(iii) =>(i): Let / be as in (iii), and for every x £ 1 let n x £ N be such 
that the set 

C x = S l J n * n (0, l/n x ) = {h£ (0, l/n x ) : /( x - h) = /( x + h)} 
is countable. 

To obtain a contradiction, assume that the continuum hypothesis fails 
and let B be a linearly independent subset of R over Q of cardinality u> 2 - 
Choose K C B of cardinality u > 2 such that for some n £ N we have n x = n 
for all x £ K. Let U be an open interval of length less than 1/n such that 
the set L = K n U has cardinality u> 2 . Then, in particular, 

f(x — h) yf f{x + h) for x £ L and h £ (0, 1/n) \ C x , (7.22) 

and 

\x — y\ < 1/n for every x, y £ L. (7.23) 

Define, by transfinite induction, a sequence (t$ £ L: £ < u> 2 ) such that 

t(_ £ L\T^ for every £ < w 2 , (7.24) 

where Tj is the smallest linear subspace of R containing {t^ : £ < £} and 
such that 


C x C Tj for every x £ T/. (7-25) 

Such T/ is obtained by applying Lemma 6.1.6(a) to the set 7, = {t,^ : £ < £} 
and the family T of operations x >— > C x , (x,y) 1 — > x + y, and x 1 — > qx for 
every q £ Q. Then Tg has cardinality < a>i, so the induction can be done 
easily. 

Now put x = to, z = t Ul and, for 0 < £ < tCi, consider the numbers 
| — x + t^|. All these numbers are different, so there is 0 < rj < uq such that 


| — x + trj\ C z . 


(7.26) 
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Put y = t rj and proceed as in Theorem 7.4.3. Define 

a = x — y + z, b = x + y — z , c = — x + y + z. (7.27) 
Then (a + b)/2 = x, (b+ c)/2 = y, and (c + a)/2 = z. We will show that 

m^m, /(c^/w, (7.2s) 

which will give us the desired contradiction, since the points a, 6, and c are 
distinct and / admits only two values. 

To prove (7.28), notice first that (7.28) follows from (7.22) as long as 

\a — x\ < 1/n, | b — y\ < 1/n, \c — z\ < 1/n (7.29) 

and 

\a-x\£C x , \b-y\£C v , \c-z\£C z . (7.30) 

But (7.29) follows easily from (7.27) and (7.23). Finally, (7.30) can be 
proved as follows. 

| a — x\ C x , since otherwise we would have z — y = a — x £ T Ul and 
z £ y + T Wl C T 0Jl , contradicting z = t Wl T Ul . 

\b — y\ (jL C y , since otherwise we would have z — x = y — b£ T Ul and 
z £ x + T Wl C T Ul , contradicting z — tun Tun ■ 

\c — z\ ^ C z , since otherwise we would have \ — x + t v \ = \ — x + y\ = 
\c — z\ £ C z , contradicting (7.26). □ 

To motivate the last theorem of this section we need to reformulate 
Theorem 7.4.1 in another language. For this we need the following defini- 
tions. We say that partition V ofRisu) sum free if for every x £ R. the 
equation a + b = x has less than u> solutions with a and b being in the same 
element of the partition, that is, when for every i£l the set 

T x = {{a, b) : a + b = x and a,b £ P for some P £ V} 

is finite. Similarly, we say that partition V ofRisu) difference free if the 
set 

D x = {(a, b) : a — b = x and a,b £ P for some P £V} 

is finite for every i£l, x ^ 0. 

In this language Theorem 7.4.1 reads as follows. 

Corollary 7.4.5 There exists a countable partition V of K. that is to sum 
free. 

Proof It is enough to notice that T 2x = {(x + h,x — h): \h\ £ S x U {0}}, 
where S x is from Theorem 7.4.1. □ 

Can we prove the same results about w-difference-free partitions of R? 
The answer is given by the next theorem, due to Ciesielski (1996). 
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Theorem 7.4.6 The continuum hypothesis is equivalent to the existence 
of a countable partition V of R that is u> difference free. 

Proof The proof of the theorem is similar to that of Theorem 7.3.9. 

<=: To obtain a contradiction assume that c > u >2 and that there exists 
a countable partition V of R that is w difference free. 

Let H be any Hamel basis. Since \H\ = c > u >2 by Lemma 7.3.11 we 
can choose disjoint sets Aq £ [Hff 2 and Bo £ [H] 2 and P £ V such that 

a + b £ P for every a £ A 0 and b £ B 0 . 

Take different bo , b\ £ Bq. Then 

(a + b 0 ) — (a + bi) = b 0 — bi 

for every a € Aq, that is, D contains all pairs (a + bo, a + bi). This 
contradicts the assumption that D x is finite for x = bo — b\ yf 0. 

=>: Represent R as the union of an increasing sequence (V a : a < x>\) 
of countable linear subspaces V a of R over Q such that V\ = U q <a f° r 
every limit ordinal A < u>\. Such a sequence exists by Lemma 6.1.6(b) 
applied to the family T of operations (x, y) i— > x + y and x i— > qx for every 
q £ Q. For convenience we will also assume that Vo = 0. 

Thus {V a+ \ \ V a : a < u>\} is a partition of R into countable sets. For 
a < u>\ let {p“ : n < w} be an enumeration of V^+i \ V a . By induction on 
a < u>i we will define one-to-one functions / : Vq+i \ V a — > u> such that the 
following inductive condition holds 2 

f(Pn) G w \ {/(p) : p £ V a & p = p" ± p? for some j < n}. 

We will show that the partition V = {/^ 1 (n) : n < u>} of R is w difference 
free. 

So choose an arbitrary x = p| yf 0 and consider the pairs (a, b) satisfying 
a — b = x with a and b being from the same element of V , that is, such 
that f(a) = f(b). It is enough to show that {a, b} n {pj : j < n} yf 0. 

Let a = p^ and b = p^. Then p^ — p 1 ^ = p|. Notice that 6 = 
max{£, a, (3} must be equal to at least two of £, a, and (3, since otherwise 
the number p with the index 8 would belong to V&. Moreover, a (3, since 
otherwise f{Pm) = fi a ) = f(b) = fiPk)i contradicting the fact that / is 
one-to-one on V a+ \ \ V a . We are left with two cases: 

If f = a> (3 then pi - pi = a - x = b = pP k e Vj. So /(pi) = /(a) = 
f(b) = f (pi — Pn) implies that m < n and a = pi £ {p| : j < n}. 

2 We define here / separately on each set from the partition {Va + 1 \ Va '• ex < ui} of M. 
Formally, we should be using a different symbol, such as /«, for such a part of /, and 
define / as the union of all f a s. However, this would obscure the clarity of this notation. 
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If £ = (3 > a then p\ + = b + x = a = € % So /(pf.) = /(&) = 

/(a) = f(p k +7*1) implies that k < n and b = p k £ (pj : j < n}. □ 

EXERCISES 

1 Prove that if / is uniformly antisymmetric then it does not have the 
Baire property. Hint: First prove the following fact due to Kuratowski: 

For every function / : R — > R. with the Baire property there 
exists a first-category set S such that /|r\s: M \ S — » R is 
continuous. 

2 (Project) Consider the following classes of subsets of M: 

• B = {B C R . B is a Bernstein set}; 

• kt = {H Cl: If is a Hamel basis}; 

• V = {Fcl: h is a Vitali set}; 

• T={Tcl:ff is a transcendental basis of R over Q}. 

Moreover, if T is any of the preceding families let 

• T* = {F C R: F + F G T}. 

Find the complete intersection/subset relations among the classes B , H, V, 
T, B*, H\ V*, and T*. 
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Chapter 8 


Martin’s axiom 


8.1 Rasiowa— Sikorski lemma 

The previous chapter was devoted to constructing objects by transfinite 
induction. A typical scheme for such constructions was a diagonalization 
argument like the following. To find a subset S of a set X concerning a 
family V = {P a : a < k} we chose S = {xj € X : £ < k} by picking each 
X£ to take care of a set P But what can we do if the cardinal k is too big 
compared to the freedom of choice of the x^s; for example, if the set X has 
cardinality less than k? 

There is no absolute answer to this question. In some cases you can 
do nothing. For example, if you try to construct a subset S of w different 
from every set from the family V{u>) = {B^ : £ < c}, then you are obviously 
condemned to failure. The inductive construction does not work, since you 
would have to take care of continuum many conditions, having the freedom 
to choose only countably many points for S. 

In some other cases you can reduce a family V to the appropriate size. 
This was done, for example, in Theorem 6.3.7 (on the existence of Bernstein 
sets) in which we constructed a nonmeasurable subset B of R ra : The natural 
family V = C of cardinality 2 C was replaced by the family Vo = { P £ : £ < c} 
of all perfect subsets of K™. The difficult part of the proof, that is, Theo- 
rem 6.3.6, was to show that the family V 0 does the job. 

Yet in other cases, such as Theorems 7.3.9, 7.4.4, and 7.4.6, the induc- 
tion could be performed in only uq steps while we had to take care of a 
family V of cardinality c. These cases resulted in us assuming that c = u>\. 
In other words, in order to rescue a diagonal argument, we had to assume 
an additional set-theoretic assumption, the continuum hypothesis. In fact, 
in Theorems 7.3.9, 7.4.4, and 7.4.6 we also proved that this assumption 
was necessary. 
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The continuum hypothesis was also assumed in the proof of Theo- 
rem 7.2.5. In this theorem, however, we did not show that the continuum 
hypothesis is a necessary assumption, though we stated that the theorem 
cannot be proved in ZFC alone. The reason for this is that Theorem 7.2.5 
can also be proved under weaker set-theoretic assumptions, which can be 
true even if the continuum hypothesis fails. 

In remainder of this text we will be mainly interested in studying differ- 
ent kinds of set-theoretic axioms that will allow us to solve the problem of 
having a “too short induction.” For this we will need some more definitions 
and notations. 

Consider a partially ordered set (P, <). A subset D C P is said to be 
dense in P provided for every pgP there exists a d £ D such that d < p. 


Examples 1. If X ^ 0 and (P, <) = (V(X), c) then D = {0} is dense in 
P. Notice that 0 is the smallest element of P. 

In fact, D = {m} is dense in (P, <) if and only if m is the smallest 
element of P. Also, if (P, <) has the smallest element m, then D C P is 
dense in (P, <) if and only if m £ D. Dense sets such as these are too easy 
to describe to be of much interest. To avoid them, we will usually study 
partially ordered sets without a smallest element. 

2. If (P, <) = (R, <) then the set D = (— oo,0) is dense in (P, <). Notice 
that the word “dense” is used here in a different sense than it is usually 
used for the linearly ordered sets (see (8.1)). However, there will be very 
little chance to confuse these two notions of density, since we will usually 
use our new definition of density for partially ordered sets that are not 
linearly ordered. 

3. Let (X,t) be any topological space. If (P, <) = (r \ {0},c) then any 
base B in X is dense in P. 

A subset F of a partially ordered set (P, <) is a filter in P if 

(FI) for every p,q £ F there is an r £ F such that r < p and r < q, and 

(F2) if q £ F and p £ P are such that q < p then p £ F. 

Note that a simple induction argument shows that condition (FI) is equiv- 
alent to the following stronger condition. 

(FI') For every finite subset Fq of F there exists an r £ F such that r < p 
for every p £ Fq. 



8.1 Rasiowa-Sikorski lemma 


131 


Examples 1. For any chain F in a partially ordered set (P, <) the family 
F* = {p £ P: 3q £ F(q < p)} 


is a filter in P. 

2. Let Q be any family of subsets of a nonempty set X that is closed 
under finite intersections and let (P, <) = (Q, c). If x £ X then the family 
F x = {Y £ Q : x £ Y} is & filter in P. A filter F in such (P, <) is called a 
principal filter if it is of this form, that is, if F = F x for some x £ X. 

3. Let Q be a nonempty family of subsets of an infinite set X and let 
u> < k < \X\. If Cy is closed under finite unions then Q fl [A] <K is a 
filter in (P, <) = (Q,d). If Q is closed under finite intersections then 
F K = {Y £ Q : \X \ Y\ < k,} is a filter in (P, <) = (Q, c). 

4. Let (P, <) = (V(X), c). Then F C P is a filter in P if (1) A fl B £ F 
provided A, B £ F: and (2) if A C B C X and A £ F then B £ F. 
(Compare this with Exercise 3 in Section 4.4.) 

Before we formulate our next example we introduce the following nota- 
tion. For nonempty sets X and Y we will use the symbol Func(X, Y) to 
denote the family of all partial functions from X into Y, that is, 

Func(A, Y) = (J { Y d : D C X} . 

Also, for an infinite cardinal number n we put 

Func K (X, Y) = (J{F d : D £ [X] <K } = {s£ Func(A, Y ) : |s| < «}. 


5. Let T C Func(X, Y) be such that for every go,gi G T 

if go U <71 £ Func(X, Y) then g 0 U g± £ T . 

(In particular, Func K (A, Y) satisfies this condition.) Then for every / £ 
Func(A, Y) the set G/ = {g £ T \ g C /} is a filter in (P, <} = (IF, D). To 
see (Fl) let g 0 ,gi £ G f . Then g 0 U g 1 C /, so h = g 0 U gi £ Func(X,E). 
Thus h £ T and, clearly, h < go and h < g\. To see (F2) notice that 
g £ Gf, h £ T , and g < h imply that h C g C /, so h £ Gf. 

In what follows we will very often use the following fact. Its main part 
can be viewed as a kind of converse of the previous Example 5. 
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Proposition 8.1.1 Let (P, <) = (F,D) for some T C Func(X, Y). If 
F C P is a filter in P then f = (J F is a function and F C {g £ T \ g C /}. 
Moreover, 

(a) if F intersects every set D x = {s £ P: i £ clom(s)} for x £ X then 
dom(/) = X ; 

(b) if F intersects every set R y = {s £ P : y £ range(s)} for y £ Y then 
range (/) = Y . 

Proof Let F C P be a filter in P and let go,gi £ F. To prove that 
/ is a function it is enough to show that go(x) = g \{x) for every x £ 
dom(c/o) H dom^). So let x £ dom(g 0 ) H dom(< 7 1 ) and let h £ F be 
such that h < go and h < g\. Then h D go and h D g\. In particular, 
x £ dom (h) and go(x) = h{x) = g\{x). 

The inclusion F C {g £ T : g C /} is obvious. 

To see (a) let x £ X. Then there exists an s £ F D D x . But s C /, so 
x £ dom(s) C clom(/). 

Condition (b) is proved similarly. □ 

In what follows we will often use filters in partially ordered sets to con- 
struct functions in a manner similar to that of Proposition 8.1.1. Usually we 
will be interested in the entire functions, and part (a) of Proposition 8.1.1 
suggests how to achieve this goal. This and the next theorem lead to the 
following definition. 

Let (P, <) be a partially ordered set and let V be a family of dense 
subsets of P. We say that a filter F in P is V-generic if 

FnD/0 for all D £ V. 

The partial orders used in the context of 25-generic filters will often be called 
forcings. Also, if (P, <) is a forcing then elements of P will sometimes be 
referred to as conditions. For conditions p,q £ P we say that p is stronger 
than q provided p < q. 

In this terminology, if all sets D x are dense in P and {D x : x £ X} C V 
then for every 25-generic filter F in P the domain of / = (J F is equal 
to X, that is, /: X — > Y. Similarly, if all sets R y are dense in P and 
{R y : y £ Y} C 25 then range(lj F) = Y for every 25-generic filter F in P. 

The next theorem, due to Rasiowa and Sikorski, shows that there are 
some interesting 25-generic filters. 

Theorem 8.1.2 (Rasiowa-Sikorski lemma) Let (P, <) be a partially or- 
dered set and p£P. If V is a countable family of dense subsets of P then 
there exists a V-generic filter F in P such that p £ F. 
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Proof Let V = { D n : n < u>}. We define a sequence (p n : n < ui) by 
induction on n < u>. We start by picking po £ Dq such that po < p. We 
continue by choosing p n+ \ £ D n+ 1 such that p n +i < Pn- 

Now let E = {p n : n < co} and put F = E* = {p £ P: 3q £ E{q < p)}. 
Then F is a filter in P intersecting every D £ V. □ 

The Rasiowa-Sikorski lemma is one of the most fundamental facts that 
will be used in the remaining sections. Its importance, however, does not 
come from its power. Its proof is too simple for this. It is the language 
of generic filters it employs that makes it so useful. In particular, it mo- 
tivates the different generalizations described in the next sections, which 
are consistent with ZFC and can be used for our problem of a “too short 
induction.” 

In most of the applications of the Rasiowa-Sikorski lemma and its gen- 
eralizations the intuition behind the proofs comes from an attempt at prov- 
ing the theorem by (transfinite) induction. More precisely, a partial order 
P used to construct an object will usually be built on the basis of an at- 
tempted inductive construction of the object. That is, conditions (elements 
of P) will be chosen as a “description of the current stage of induction.” 
The inductive steps will be related to the dense subsets of P in the sense 
that the density of a particular set D x = {p £ P: <p(p, x)} will be equivalent 
to the fact that at an arbitrary stage q of the inductive construction we 
can make the next inductive step by extending the condition q to p having 
the property ip(p, x). In particular, the family T> of dense subsets of P will 
always represent the set of all inductive conditions of which we have to take 
care, and a P-generic filter in P will be an “oracle” that “takes care of all 
our problems,” and from which we will recover the desired object. 

Evidently, if the number \T>\ of conditions we have to take care of is 
not more than the number of steps in our induction, then usually the 
(transfinite) induction will be powerful enough to construct the object, 
and the language of forcing will be redundant. In particular, this will be 
the case for all the applications of the Rasiowa-Sikorski lemma presented 
in the rest of this section. These applications, however, are presented here 
to see the use of the generic-filters technique in the simplest situations. 
Moreover, in the next sections the same theorems will be either generalized 
or used for some motivation. 

To state the next theorem let us recall that, for a linearly ordered set 
(X, <), a subset D of X is dense in X if 

for every x, y £ X with x < y there is a d £ D such that x < d < y. (8.1) 

A linearly ordered set (X, <) is said to be dense if it is dense in itself, that 
is, if A is dense in X. 
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Theorem 8.1.3 Any two countable dense linearly ordered sets, neither of 
which has a first or a last element, are order isomorphic. 

Proof Let ( X , <) and (Y, X) be two linearly ordered sets as in the theorem. 
The inductive proof of the theorem may go as follows. Enumerate X and 

Y as X = {x n : n < uj} and Y = {y. n : n < u>}. Construct, by induction on 
n < lo, a sequence ho C h\ C h% C • • • of functions such that each h n is an 
order isomorphism between X n £ [A'] <aJ and Y n £ [Y] <w , where x n £ X n 
and y n € Y n for every n < w. Then h = U n <w is an or der isomorphism 
between X and Y . The difficult part of the proof is to extend h n to h n + \ 
while maintaining the condition x n +\ £ X n+ i and y n +i G Y n+ 

To translate this proof into the language of partially ordered sets, let 
P be the set of all possible functions h n : X n — > Y n from our inductive 
construction, that is, the set of all finite partial isomorphisms from X 
to Y: 

¥ = {h £ Func w (A) Y) : h is strictly increasing}. 

Consider P to be ordered by reverse inclusion, that is, (P, <) = (P, D). For 
x £ X and y £ Y put 

D x = {h £ P: x £ dom(/i)} and R y = {h £ P: y £ range(/i)}. 

These sets are the counterparts of the conditions u x n £ X n ” and “y n £ Y n .” 
We will show that 

the sets D x and R y are dense in P, (8-2) 

which is a translation of the fact that the inductive step of our inductive 
construction can always be made. 

Before we prove (8.2), let us show how it implies the theorem. Let 

V = { D x : x £ A}U{f? y : y £ Y}. Then V is countable since the sets X 
and Y are countable. Thus, by the Rasiowa-Sikorski lemma, there exists a 
25-generic filter F in P. Then, by Proposition 8.1.1, / = (JF is a function 
from X onto Y . It remains only to show that / is strictly increasing. 

To see this, let Xo,Xi € X with Xo < x±. Then there are go,gi £ F 
such that Xo £ dom(g 0 ) and X\ £ dom(g( 1 ). Let g £ F be such that g < go 
and g < g 1 . Then j 0 UjiCjC /. So /( x 0 ) = g(x 0 ) ~< g(x i) = f(x i), 
since g is strictly increasing. Thus / is strictly increasing. We have proved 
that /: X — > Y is an order isomorphism. 

To finish the proof it is enough to show (8.2). So let x £ X. To prove 
that D x is dense, let g £ P. We have to find h £ D x such that h < g, that is, 
a function h £ P such that h D g and x £ dom (h). If x £ dom(g) then h = g 
works. So assume that x e/L dom(g). Let dom(g) = {xo,Xi, , .. , x n } with 
Xq < x\ < ■ ■ ■ < x n and let y,; = g(xf) for i < n. Then yo Hi ~< ■ ■ ■ -< y n , 
since g is strictly increasing. We will define h as gU{(x, y)}, where y £ Y is 
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chosen in such a way that h is strictly increasing. To do this, it is enough to 
pick y GY \ { yo , y 1 , . . . , y n } such that the following holds for every i < n: 

x < Xi & y -<yi. (8.3) 

If x < Xo, pick y -< yo- Such y exists, since Y does not have a smallest 
element. If a: > x n , pick y >- y n , which exists since Y does not have a 
largest element. So assume that Xi < x < ajj+i for some i < n. Then 
choose y G Y such that y,; -< y -< y-i+i, which exists since V is dense. It is 
easy to see that such y satisfies (8.3). 

We have proved that each D x is dense in P. The proof that every R y 
is dense in P is almost identical, and is left as an exercise. □ 

Evidently (Q, <) is an example of a dense linearly ordered set with nei- 
ther a first nor a last element. Thus Theorem 8.1.3 says that any countable 
linearly ordered dense set without a first or a last element is isomorphic to 
(Q, <). The order type of this class is usually denoted by the letter r). 

In fact, it can be proved that (Q, <) is universal for the class of all 
countable linearly ordered sets, in the sense that every countable linearly 
ordered set is isomorphic to (S,<) for some S CQ (see Exercise 1). 

Clearly, (R, <) is not isomorphic to (Q, <). The order type of (R, <) 
is usually denoted by A. To give a characterization of the order type A 
similar to that of Theorem 8.1.3 we will need the following definition. A 
linearly ordered set ( X , <) is said to be complete if every subset of X 
that is bounded from above has a least upper bound. That is, if the set 
B(S) = {b G X : \/x € S (x < b)} is not empty for some S C X then B(S) 
has a least element, denoted by sup A. It is a fundamental fact that (R, <) 
is complete. 

Theorem 8.1.4 Any two complete linearly ordered sets both having count- 
able dense subsets, and having neither a least nor a largest element, are 
order isomorphic. 

Proof Let ( X , <) and (Y, A) be linearly ordered sets as described in the 
theorem. Let X 0 and Y 0 be countable dense subsets of X and Y, re- 
spectively. It is easy to see that A 0 and Y 0 satisfy the assumptions of 
Theorem 8.1.3. 

Let /o : X 0 — > Y 0 be an order isomorphism between (A 0 , <) and (Lo, A). 
Define f:X — > Y by f(x) = sup{/o(cco): x 0 G X 0 & x 0 < x}. It is not 
difficult to prove that / is an order isomorphism. The details are left as an 
exercise. □ 

The next theorem will show that we can also use the Rasiowa-Sikorski 
lemma in proofs by transfinite induction. To formulate it we need the 
following definition. For f,g: u-iwwe define 

/ <* g f(n) < gin) for all but finitely many n < w. 


(8.4) 
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It is easy to see that the relation <* is transitive on u) u . 

Theorem 8.1.5 If G C u> u has cardinality < u>\ then there exists a <*- 
increasing sequence (fe: £ < u>\) such that for every g £ G there exists a 
f < oj i with g <* fc. 

Before we prove it, notice first that it immediately implies the following 
corollary. For its formulation we need the following definition. A sequence 
(/t € u) u : f < k) is called a scale in if it is <*-increasing and if for 
every g £ uA there exists a £ < k such that g <* f 

Corollary 8.1.6 If the continuum hypothesis holds then there exists a scale 
(ft e u“: £ < c). 

In the next section we will show that Corollary 8.1.6 can also be proved 
when the continuum hypothesis is false. 

The proof of Theorem 8.1.5 will be based on two lemmas. The first 
of these lemmas is not essential for the proof, but it serves as a good 
approximation for the second one. 

Lemma 8.1.7 If G C u> u has cardinality < u> then there exists an f £ lo u 
such that for every g £ G 

g(n ) < /(n) for infinitely many n < io. 

Proof An inductive construction of such / can be done as follows. Let 
G x u> = {(g n ,k n ): n < uj}. Construct a sequence /o C /i C fi C • • • 
of functions such that each /„ € Func w (a>,w) and for every n < to there 
exists an m £ dom (/„) with m > k n such that /„(m) > g n {m). Then 
/ = U n <u fn e Funcfw, lv) and any extension / of / to to has the desired 
property, since for every g £ G and k < lo there exists an m > k with 
/(m) > g(m). 

To translate this proof into the language of partially ordered sets, let P 
be the set of all possible functions /„ as before, that is, take 

P = Func w (o>, lo), 

and order it by reverse inclusion D. For g £ uA and k < u> define 

Dg ={s€P: 3m > k (?n £ dom(s) & s(?n) > g(ra))}. 

These sets are the counterparts of the inductive condition “f n (m) > g n {m) 
for some m > k n .” 

Notice that the sets Dg are dense in P, since for every i € P there exists 
an m £ w \ (k U dom(f)) and for s = t U {(m, g{m) + 1)} we have s £ Dg 
and s < t. 
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Let V = {Dgi k < u> & g G G} and let F be a ©-generic filter in 
P. Then, by Proposition 8.1.1, (J F is a function. Let /: to — > to be any 
extension of (J F . Then / has the desired property, since for every g £ G 
and k < to there are s £ FnD g and m > k such that /(to ) = s(m) > g(m). 
□ 


The lemma that we really need for the proof of Theorem 8.1.5 is the 
following. 

Lemma 8.1.8 I/GCw“ has cardinality < to then there exists an f £ lo w 
such that 

9 <* / for every g £ G. 

Proof An inductive construction of such / can be done as follows. Let 
G = {g n : n < u} and to = {x n : n < u>}. Construct a sequence /o C /i C 
fi C • • • of functions such that f n £ Func a ,(w, u>) and x n £ clom(/„) for 
every n < to. Moreover, at the inductive step n < to choose /„ such that 
f n (x) > gi{x) for all functions g t looked at so far and all numbers x that 
have not yet been considered up to this point of the induction, that is, such 
that 


f n { x) > gi(x) for all x £ dom(/„) \ dom(/„_i) and i < n. (8.5) 

Then / = U„ <a ; fn ■ oj -» u and g { <* / for every i < lo. 

One problem in finding an appropriate forcing for this construction 
is that the “current stage of induction” includes information not only on 
“functions f n constructed so far” but also on “all functions gi considered 
so far.” The first part of this information is included in functions from 
P = Func a ,(o;, u>), as in Lemma 8.1.7. However, we need the second piece 
of information as well, which is to be coded by finite subsets of G. Thus 
we define the partially ordered set P* = P x [G] <w . Moreover, the partial 
order on P*, in order to describe the extension as in (8.5), must be defined 
as follows. For ( s,A),(t,B ) £ P* we define {s,A} < ( t,B ) provided sDf, 
Ad B, and 

s(n) > g(n) for all n £ dom(s) \ dom(t) and g £ B. (8.6) 

Now the inductive conditions u x n £ dom(/ n )” and “every element g of G 
is taken care of at some stage of the induction” are coded by the following 
subsets of P*: 

D n = {(s, A) £ P* : n £ dom(s)} and E g = {(s, A) £ P* : g £ A}, 
where g £ G Cw“ and n < to. 
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The sets E g are dense in P*, since for every (s,A) £ P* we can pick 
(s, A U { g }) € Eg and clearly (s, A U {</}) < (s, A), as condition (8.6) is 
then satisfied vacuously. 

To see that a set D n is dense in P* take p = (t,A) € P*. We have to 
find q £ D n such that q < p. If n £ dom(f) then p £ D n and q = p works. 
So assume that n ^ dom(t). Define q = ( s , A) £ D n with s = tU{(n, fc)} 
and k = sup {g(n) + 1 : g £ A}. Then q < p, since clearly s D t, A D A, 
and (8.6) holds, as s(n) = k > g(n ) for all g £ A. 

Now let V = { D n : n < iv} U {E g : g £ G}. Then V is a countable 
family of dense subsets of P* so, by the Rasiowa-Sikorski lemma, we can 
find a P-generic filter F in P*. Notice that Fo = {s: (s, A) £ F} is a filter 
in P. Thus, by Proposition 8.1.1, / = (JFq is a function. Notice also that 
D n D F ^ 0 implies n £ dom(/). Therefore f maps to into itself. 

To finish the proof it is enough to show that g <* f for every g £ G. 
So let g £ G. Then there exists (t, B) £ F n E g . We will prove that 

f(n) > g(n ) for every n £ u>\ dom(t). 

This is the case since for every n £ u> \ dom(f) there is (t' , B ') G F IT D n . 
Take (s,A) £ F with (s, A) < (t,B) and (s,A) < ( t’,B '). Then n £ 
dom(t') C dom(s) so n £ dom(s) \ dom(t) and g £ B. Therefore condition 
(8.6) for (s,A) < (t,B) implies that /(n) = s(n) > g{n). □ 

Proof of Theorem 8.1.5 Let G = {g^ : £ < u>i}. Define (/^: ^ < u>\) by 
induction on ^ < to\ by choosing /j as the function / from Lemma 8.1.8 
applied to G = {/^ : ( < U {g^: ( < £}. It is easy to see that this 
sequence has the desired property. □ 

EXERCISES 

1 Prove that for every countable linearly ordered set (X, there exists 
a strictly increasing function / : X — > Q. (Such a function / is said to be 
an order embedding of A' into Q, and it establishes an order isomorphism 
between (A, -<) and (/[X],<).) 

2 Prove that (R, <) is complete as a linearly ordered set (use Dedekind’s 
definition of real numbers). 

3 Complete the details of the proof of Theorem 8.1.4 by showing that 

(a) the sets Xq and Yq are dense in themselves and have neither least nor 
largest elements; 

(b) the function / : X — > Y is an order isomorphism. 
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4 A subset I? of a linearly ordered set ( X , <) is weakly dense if 

for every x, y £ X with x < y there is a d £ D such that x < d < y. 

Prove that for every linearly ordered set ( X , <) containing a countable 
weakly dense subset there exists a strictly increasing function / : X — > R. 

5 Prove that the relation < defined on the set P* as in Lemma 8.1.8 is 
indeed a partial-order relation. 

6 For f,g: u> — » to we define 

/ <* g <t=> fin) < g[n) for all but finitely many n < w. 

Show that <* is a preorder relation on wF (see Exercise 4 from Section 2.4). 
The equivalence relation generated by <* is usually denoted by =*. Show 
also that the partial-order relation ;<* induced by <* on the family of all 
equivalence classes of =* is a linear-order relation. 

8.2 Martin’s axiom 

In this section we would like to introduce an axiom that says that in a large 
number of situations the statement “for countably many” can be replaced 
by “for less than continuum many,” even when the continuum hypothesis 
fails. 

In particular, we would like to be able to make such a replacement in 
the Rasiowa-Sikorski lemma, which leads us to the following statement. 

(*) Let (P, <) be a partially ordered set. If V is a family of dense subsets 
of P such that \T>\ < c, then there exists a 2?-generic filter F in P. 

Clearly (*) is implied by the continuum hypothesis. However, it is false 
under the negation of the continuum hypothesis. To see it, consider the 
partially ordered set (P, <) = (Func„,(w, oq), D) and, for £ < uq, the sets 
Uf = {s£P:(e range(s)}. Each set R $ is dense, since for every s € P 
we have t = sU{(n,()) £ R % and t < s for any n £ to \ dom(s). But 
V = { R j: £ < LJt} has cardinality < to\ < c. So (*) implies the existence 
of a P-generic filter F in P, and, by Proposition 8.1.1, (J F is a function 
from a subset of u onto uq. This is clearly impossible. 

Thus, in order to find a (*)-like axiom that is consistent with the nega- 
tion of the continuum hypothesis we will have to restrict the class of par- 
tially ordered sets allowed in its statement. In particular, we will have to 
exclude forcings such as (Func w (u;, uq), D). To define such a class we need 
some new definitions. 
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Let (P, <) be a partially ordered set. 

• x,y & P are comparable if either x < y or y < x. Thus a chain in P is a 
subset of P of pairwise-comparable elements. 

• x,y £ P are compatible (in P) if there exists a z £ P such that z < x 
and z < y. In particular, condition (FI) from the definition of a filter 
says that any two elements of a filter F are compatible in F. 

• x,y £ P are incompatible if they are not compatible. 

• A subset A of P is an antichain (in P) if every two distinct elements 
of A are incompatible. An antichain is maximal if it is not a proper 
subset of any other antichain. An elementary application of the Hausdorff 
maximal principle shows that every antichain in P is contained in some 
maximal antichain. 

To illustrate these notions consider (P, <) = (Func w (X, Y), D). Then 
s,t £ P are compatible if and only if s U t £ Func^fX, Y). Therefore 
elements s,t £ P are incompatible if there exists an x £ dom(s) (~l clom(f) 
such that s(x) ^ t(x). For any nonempty D £ [X] <w the set A = Y D is a 
maximal antichain. On the other hand, if C C P is a family of functions 
with pairwise-disjoint domains, then any two elements of C are compatible. 

The forcings that will be used in our (*)-like axiom are defined in terms 
of antichains in the following way. 

• A partially ordered set (P, <) is ccc (or satisfies the countable chain 
condition) if every antichain of P is at most countable. 1 

Clearly, every countable partially ordered set is ccc. In particular, the 
forcing (P, <) = (Func w (w, w), d) from Lemma 8.1.7 is ccc. Notice also 
that the forcing 

P* = Px [G] < “ from Lemma 8.1.8 is ccc (8-7) 

for an arbitrary C C including G = ui u . To see this, take an un- 
countable subset A = {pc £ P*: £ < w 1 } of P* with p % = (s$, A^) and 
notice that there must be a ( < £ < u>\ such that sy = s^. But then, for 
p = U A^) £ P* we have p < Pq and p < p £, condition (8.6) being 

satisfied vacuously. So A is not an antichain. 

On the other hand, the forcing (P, <) = (Func w (o;, wi), D) is not ccc, 
since {{(0,£)} £ P: £ < u>i} is an uncountable antichain in P. Thus, 

1 The name “countable chain condition” is certainly misleading. A more appropriate 
name would be “countable antichain condition” or “cac.” However, the tradition of this 
name is very strong and outweighs reason. This tradition can be explained by the fact 
that every partially ordered set can be canonically embedded into a complete Boolean 
algebra, and for such algebras the maximal sizes of chains and antichains are equal. 
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restricting (*) to the ccc forcings removes the immediate threat of the pre- 
viously described contradiction with ->CH, the negation of the continuum 
hypothesis. In fact such a restriction also removes all possibility of any con- 
tradiction with -^CH. More precisely, consider the following axiom, known 
as Martin’s axiom and usually abbreviated by MA. 

Martin’s axiom Let (P, <) be a ccc partially ordered set. If I? is a family 
of dense subsets of P such that \V\ < c, then there exists a 25-generic 
filter F in P. 

Clearly CH implies MA. But MA is also consistent with ZFC and ->CH, 
as stated by the next theorem. 

Theorem 8.2.1 Martin’s axiom plus the negation of the continuum hy- 
pothesis MA-h-iCH is consistent with ZFC set theory. 

The proof of Theorem 8.2.1 will be postponed until Section 9.5 where 
we will prove that MA is consistent with c = W 2 - However, the same proof 
can be used to prove the consistency of MA with c = k for most regular 
cardinals n. 

In the remainder of this section we will see several consequences of MA. 
First we will see the following generalization of Corollary 8.1.6. 

Theorem 8.2.2 If MA holds then there exists a scale (/f € uA : £ < c). 

Proof The argument is essentially identical to that for Theorem 8.1.5. 
First notice that the following generalization of Lemma 8.1.8 is implied by 
Martin’s axiom. 

(I) If G C oA has cardinality < c then there exists an / G lo u such that 
g <* f for every g G G. 

To see why, consider the forcing P* = Func a ,(cc, u>) x [G] <w defined as 
in Lemma 8.1.8, that is, ordered by 

(s. A) < (t, B) <G> sDt&ADB 

& s(n ) > g(n) for all n G dom(s) \ dom(t) and g € B, 

and its dense subsets 

D n = {(s, A) G P* : n € dom(s)} and E g = {(s, A) € P* : g G A} 

for g G and n < u. Then V = {D n : n < ui} U { E g : g G G} has 
cardinality |G| + u> < c. Since by (8.7) P* is ccc, MA implies that there 
exists a 15-generic filter F in P*. Now we argue as in Lemma 8.1.8. We 
define Fq = {s: (s,A) G F} and notice that / = (J F 0 is a function. Then 
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D n PF yf 0 implies that n £ dom(/), that is, that / maps u into oj. Finally, 
we notice that E g D F ^ 0 implies g <* f. 

To prove the theorem from (I) enumerate as { g j : £ < c} and define 
the scale (/^ : £ < c) by induction on £ < c by choosing as a function / 
from (I) applied to G = {/^ : £ < £} U {g^ : £ < £}. □ 

Another application of Martin’s axiom is stated in the next theorem. 

Theorem 8.2.3 Assume MA. If X £ [R] <c then every subset Y of X is a 
Gg subset of X, that is, there exists a Gg set G CM such that G Pi X = Y . 

Proof Let X £ [R] <c and fix Y C X. We will show that Y is Gg in X. 

Let B = {B n : n < u>} be a countable base for R. First notice that it is 
enough to find a set A C ui such that for every x £ X 

x £ Y -o- x £ B n for infinitely many n from A. (8-8) 

To see why, define for every k < oj an open set G& = (J {B n : n £ A & n > k} 
and put G = f) fc<u Gk- Then G is a Gg set and, by (8.8), for every x £ X 
we have 

x £ Y <£> x £ Gfc for all k < to. 

Thus G P X = Y. 

For countable X we could prove (8.8) in the following way. Let X\Y = 
{z n : n < w} and Y x u> = {( y n , k n ) : n < u>}. Then construct A as a union 
of an increasing sequence {A n : n < to) of finite subsets of u> such that for 
every n < oj there is an m € A n such that m > k n and y n £ B rn , and such 
that Zi B m for all new ms from A. n and all points 2 , considered so far, 
that is, such that 

Zi B m for all m £ A n \ A n _i and i < n. (8.9) 

We will transform this idea into a forcing similarly as in Lemma 8.1.8. 
We define the partially ordered set (P, <) by putting P = [w] <aJ x [A\F] <W 
and define < to take care of (8.9). That is, for (A l5 Gi), (A 0 ,G 0 ) £ P we 
define (Ai,Gi) < (A 0 ,Go) provided A\ D A 0 , Gi D Go, and 

c fL B m for all m £ A\ \ A 0 and c £ Go- (8.10) 

Now inductive conditions are coded by the following subsets of P: 

Dy = {(A, G) £ P: 3m £ A (m > fc & y £ B m )} 

and 

E z = {(A,C)£ P: z £ G}, 
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where y £ Y, k < w, and z £ X \ Y. We will use Martin’s axiom to find a 
25-generic filter for 

V = {D k y : y £Y & k <uj}U{E z : z £ X\Y}. 

To use Martin’s axiom, we have to check whether its assumptions are 
satisfied. 

Clearly |25| < |X| + w < c. 

To see that P is ccc consider an uncountable subset {(A) C{) : £ < uq} 
of P. Since [w] <w is countable, there are A £ [ijj] < u1 and f < £ < u>i such 
that A = A = A. Then (A,Cc) = ( A,C and (A>C{) = ( A,C $) are 
compatible, since (A, Cq U C() £ P extends them both, as condition (8.10) 
is satisfied vacuously. 

To see that each set E z is dense in P take {A, C) £ P and notice that 
(. A , C U {^}) € E z extends (A, C). 

Finally, to see that a set D y is dense in P, take ( A , C) £ P. Notice that 
there exist infinitely many basic open sets B rn such that 

y £ B m and C D B. m = 0. (8-11) 

Take m > k satisfying (8.11), and notice that {A U {m}, C) £ D k y extends 
(AC). 

Now apply Martin’s axiom to find a 25-generic filter F in P, and define 
A = |J{^ : (A C) £ F}. We will show that A satisfies (8.8). So let x £ X. 

If x £ Y then for every k < uj there exists (A,C) £ FT 1 D k . In 
particular, there exists an m £ A C A with m > k such that x £ B m . So 
x £ B m for infinitely many rn from A. 

If x £ X\Y then there exists (An Co) € FflE x . In particular, x £ Co- 
lt is enough to prove that x ^ B m for every m £ A\A 0 . So take m £ A\Ao. 
By the definition of A there exists (A, C) £ F such that m £ A. But, by 
the definition of a filter, there exists (A, Ci) £ F extending (A,C) and 
(A, C 0 ). Now (A, Ci) < (A) C 0 ), m £ A c A, m $ A- and x £ C 0 . 
Hence, by (8.10), x ^ B m . □ 

Corollary 8.2.4 If MA holds then 2“ = 2 K for every infinite cardinal 

k < c. 

Proof Let « < c be an infinite cardinal number. Then clearly 2“ < 2 K . 
To see the other inequality take X £ [R] K . Then, by Theorem 8.2.3, 

2 K = \V{X)\ = \{B (~l X : B £ Bor} \ < \Bor\ = 2“. □ 
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Corollary 8.2.5 If MA holds then c is a regular cardinal. 

Proof Notice that by Theorem 5.3.8 

2 cf ( c ) = (-2 w ) cf W _ £ cf(c) > c 

Hence, by Corollary 8.2.4, cf(c) > c. □ 

The next theorem tells us that MA implies the continuum additivity 
of category, which has been mentioned in the remark after Theorem 7.2.5. 
In particular, it implies that Theorem 7.2.5 can be proved when CH is 
replaced with MA. 

Theorem 8.2.6 If MA holds then a union of less than continuum many 
meager subsets of 1" is meager in R” , that is, 

\^F € M for every T £ [A4] <c . 

Proof The idea of the proof is very similar to that of Theorem 8.2.3. 

Let T £ [A4] <c . Since every F from T is a countable union of nowhere- 
dense sets, \JF is a union of \F\ +to < c nowhere-dense sets. Thus we may 
assume that every set in T is nowhere dense. 

Let B = {Bj ^ 0: j < w} be a countable base for R n . Notice that it is 
enough to find a set A C ui such that 

{m £ A: B m is finite for every F £ T (8.12) 

and 

{m £ A: B. m C Bj} is infinite for every j < ui. (8.13) 

To see why, define Uk = (J {B m : m £ A & m > k} for every k < w. 

Notice that by (8.13) the sets Uk are dense and open. Therefore the sets 
R" \ Uk are nowhere dense. But condition (8.12) implies that (J.F C 
U fc<w (R" \ Uk) £ M, since for every F £ F there exists a k < u such that 

Uk fl F = {B m fl F: to £ A & to > k} = 0. 

To prove the existence of such a set A consider the partial order IP = 
[w] <a ' x [l p ] <w , where we put (A-y,C\) < (A 0 ,C 0 ) provided Ai D A 0 , 
C\ D Co, and 

B m fl F = 0 for all m £ A\ \ A 0 and F £ Co- (8-14) 

We will define A as the union of all A such that (A, C) belongs to an 
appropriate generic filter in P. In particular, in the condition (A,C) £ P 
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the set A approximates A. The sets F from C represent “elements of T 
looked at so far” in our “induction” and (8.14) guarantees that the basic 
open sets indexed by “new elements of A” will not intersect these “old” 
Fs. This will take care of (8.12). 

The forcing P is ccc since [cc] < “ is countable and any conditions (A, Co) 
and (A, Ci) are compatible, having {A, Cq U C\) as a common extension. 

Conditions (8.13) and (8.12) are related to the following dense subsets 
of P: 

Dj = {(H, C) £ P: 3m £ A (to > k & B m c Bj)} 

and 

E F = {(A,C) eP: F £C}, 

where k,j < to and F £ T . The sets E F are dense, since (A,CU{F}) £ E F 
extends (A,C) for every (A,C) £ P. To see that the sets D k are dense take 
(H, C) £ P. Since (J C is nowhere dense, there exist infinitely many basic 
open sets B m such that 

B m cBj\\JC. (8.15) 

Take m > k satisfying (8.15) and notice that (H U {m},C) £ D k extends 

(AC) • _ _ _ „ 

Hence, by Martin’s axiom, there exists a P-generic filter F in P, where 
V = {D k : k,j <uj}U{E f : F£F}. 

Define A = (J{^4 ; ( A,C ) £ i* 1 }. It is enough to show that A satisfies 
properties (8.13) and (8.12). 

To see (8.13) take j < u>. It is enough to prove that for every k < ui 
there exists an m £ A with m > k and B m C By So fix k < u> and pick 
(A, C) £ Fn Dj. Then, by the definition of D k , there exists an m £ A C A 
with m > k such that B m C Bj. 

To see (8.12) take F £ T. Then there exists (H 0 ,C 0 ) £ F H E F . In 
particular, F £ Co- It is enough to prove that B m n F = 0 for every 
to £ A \ Aq. Take m £ A \ Ho- By the definition of A there exists 
(A,C) £ F such that to £ A. So there exists (A\,C\) £ F extending (A,C) 
and (H 0 , Co)- Now (Hi,Ci) < (Ho, Co), to £ Ac Hi, to ^ H 0 , and F £ Co- 
Hence, by (8.14), B m n F = 0. □ 

An analog of Theorem 8.2.6 for the ideal A f of measure-zero subsets of 
R™ is also true. 

Theorem 8.2.7 If MA holds then a union of less than continuum many 
nidi subsets o/l" is null in R", that is, 

[JiF £ N for every T £ [A/"] <c . 
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Proof To make the argument simpler, we will prove the theorem only for 
R. The proof of the general case is essentially the same. 

Let B 0 be the family of all open intervals in R and let 1(1) stand for 
the length of I G Bo- Recall that S G J\f if for every e > 0 there exists a 
sequence (I k G B 0 '- k < to) such that S C Ufc< w h and Y) k<w Kh) < e - 
Now, if B = {/„: n < uj} is the family of all intervals with rational 
endpoints, then the family Bo can be replaced by B. To see it, take S G J\f 
and let e > 0. Then there exists a sequence (J k G Bq- k < w) such 
that S C Ufc<a, J k and J2 k < u ^(4) < e/2. But for every k < ui there 
exists an l k G B o such that J k C l k and l(I k ) < l(J k ) + s/2 k+2 . So 
5 C U fe<a , Jk c \J k<u 4 and E fc<w Kh) < E fc «J(4) + e/2 fc+2 ] < e. 

Let T G [A/] <c and hx e > 0. We will find anicw such that 

U^u Ii% and ^ ^ /(-Z^) y £ • (8.16) 

Define P={dCw: E„ g a l(I„) < s} and order it by reverse inclusion: 
A\ < A 0 <G> Ai D A 0 . To see that P is ccc let A C P be uncountable. We 
have to find different A, A' G A that are compatible, that is, such that 
EngAUA' Kh) < £ - 

So for every A G A let niA <w be such that E« g a Kh) + 1/mi < £■ 
Since A is uncountable, there exists an uncountable subset A' of A and an 
m < uj such that in a = m for every A G A', that is, 

V l(I n ) + — < £ for every A G A'. (8-17) 

m 

neA 

Next, for every 4 g / choose kA < uj such that Eng A\k A Kh) < 1 /m. 
Since A' is uncountable, there exists an uncountable subset A" of A' and 
a k < to such that Au = k for every A G A", that is, 

y l (In) < — for every A G A" . (8.18) 

m 

n£A\k 

Now we can find two different A, A! G A" such that A fl k = A' Cl k. For 
such A and A! we have, in particular, A U A' — A U (A' \ k). So, by (8.18) 
and (8.17), 

y Kh) < y Kh) + y Kh ) < y w +^< e - 

n£AuA f n^A n£A'\k n£A 

Thus A and A' are compatible and P is ccc. 

Now for every F G T let Dp = {A G P: F C Ung Ah}- Notice that 
every Dp is dense in P. To see this, let A G P and put 8 = e— E n g a Kh) > 
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0. Since F £ AT we can find a B C u> such that F C ( J neB I n and 
J2neB K^n) < b. But then A U B € Dp since 

5 ] Kin) < Y Kin) + Y Kin) < Y l ^n) + <5 = £ 

neAUB neA nGB nGA 

and F C UneAus In- Since A U B < A, the set Dp is dense. 

Let V = {Dp: F € F}. Since P is ccc and |2?| < \F\ < c, by Martin’s 
axiom there exists a 27-generic filter F in P. Let A = (J F. We will show 
that A satisfies (8.16). 

For every F £ T there exists an A € F n Dp. So F C {J neA I n C 
UneA In since Ac A. Then \JF C \J n eA In- 

To see J2 n eA Kin) < £ it is enough to show that J2 n eAnk Kin) < e for 
every k < u>. So fix k < to. Now for every n £ A n k there exists an A n £ F 
such that n £ A n . Using condition (FI') of the definition of a filter we can 
find an A £ F such that A < A n for every n £ An k. But then A fl k C A 
and 

E KIn)<YK I n)<e. 

neAnfc neA 

This finishes the proof. □ 

All previous applications of Martin’s axiom could be deduced as well 
from the continuum hypothesis, but in the remaining part of this section 
we will show that this will not always be the case, by discussing the con- 
sequences of MA+- 1 CH, which do not follow from CH. Similar results can 
also be found in the next section. 

In the next theorem we will use the following terminology. A subset A 
of a partially ordered set (P, <) is compatible if for every finite subset Aq 
of A there exists ap£P such that p < q for all q £ A 0 (compare this with 
condition (Fl') from the definition of a filter). 

Theorem 8.2.8 Assume MA+^CH and let (P, <) be a ccc partially or- 
dered set. If A C P is uncountable, then there exists an uncountable com- 
patible subset A of A. 

Proof Let A C P be uncountable. Without loss of generality we may 
assume that |A| = uq. 

The inductive approach to the proof is to construct a strictly increasing 
sequence (A^ : f < tof) of compatible subsets of A such that Uf<u; is 
an uncountable compatible subset of A. Although this idea is basically 
correct, it has a fundamental flaw. If you start with Aq £ [A]-^ such that 
Aq is compatible with at most countably many elements of A, then this 
construction must fail. Fortunately, there are only countably many sets 
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Aq £ [A] - u that may cause such a problem, and after removing them from 
A we will be able to follow the idea just described. 

So for p £ P let A p = {q £ A: q is compatible with p}. We will show 
that the set 

A' = {q £ A: 3p < q {\A p \ < u)} is at most countable. (8.19) 

Indeed, consider the family T = {C C B: C is an antichain in P}, 
where B = {p £ P: \A p \ < u>}. It is easy to see that T satisfies the 
assumptions of the Hausdorff maximal principle. Thus we can choose a 
maximal element Co of T . Now Co is clearly an antichain in P. So Co is 
at most countable, since P is ccc. It is enough to prove that 

A' c { J A r , 

reC 0 

since the set {J reC() countable, being a countable union of countable 
sets. To see the inclusion, take q £ A'. Then there exists a p < q such 
that p £ B. By the maximality of Co there exists an r £ Co such that r 
is compatible with p, that is, s < r and s < p < q for some s £ P. In 
particular, r and q are compatible, so q £ A r . Condition (8.19) has been 
proved. 

Now the set A* = A \ A’ is uncountable. Moreover, if p < q for some 
q £ A* then A p must be uncountable. In particular, 

A* = {q £ A* : q is compatible with p} is uncountable (8.20) 

for every p £ P such that p < q for some q £ A*. 

Coming back to the idea of an inductive proof, we can see that we 
could have a hard time extending an infinite set A ^ to a compatible set 
properly containing A^. This problem will be solved with the help 
of MA. 

For this, consider a partially ordered set 

P* = {F £ [A*] <ul : F is compatible in P} 

ordered by reverse inclusion D. Notice that the forcing P* is ccc. To see 
why, let {.Fj £ P* : £ < We will find ( < £ < to\ such that F^ and F, 1 
are compatible, that is, that F^UF^ £ P*. But for every £ < u>\ there exists 
a p^ £ P with p^ < p for every p £ F^. Moreover, the set { p £ < o>i} 
cannot be an antichain, since P is ccc. Thus there are compatible p £ and 
P£ for some £ < £ < Now, if q £ P is such that q < p^ and q < pj then 
q < p for every p £ F^ U F%. Therefore Fq U F^ £ P* and P* is ccc. 

Let : ^ < wi) be a one-to-one enumeration of A*. For a < u>\ define 

D a = {F£P*:3(>a( qi £F)} 
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and notice that the sets D a are dense in P*. This is so since for every 
F £ P* there exists a p £ P such that p < q for every q £ F, and, by 
(8.20), p is compatible with uncountably many q £. In particular, there 
exists a £ > a such that qc is compatible with p, and it is easy to see that 
FU {q^} £ D a extends F. 

Let V = { D a : a < u>\}. Then \V\ < lo\ < c so, by MA, there exists 
a "D-generic filter F in P*. We will show that A = (J F is an uncountable 
compatible subset of A. 

Clearly, A C A. To see that A is uncountable, notice that for every 
a < u> i there exists a £ > a such that q ^ £ A, since F fl D a 0. To finish 
the proof, it is enough to show that A is compatible. 

So let F = {r 0 , . . . , r„} C A. For every i < n there exists an Fi £ F 
such that r.j £ F t . By (Fl') we can find an E £ F such that E < Fi for all 
i < n. In particular, E D {?’o, ■ ■ ■ , r n } = F, that is, F is compatible. □ 

By definition, a forcing P is ccc if every uncountable set {p a : a £ A } 
contains two different compatible elements. Equivalently, P is ccc if for 
every sequence (p a : a £ A) (not necessarily one-to-one) with uncountable 
index set A there are different indices a, j3 £ A such that p a and pp are 
compatible. In this language Theorem 8.2.8 can be restated as follows. 

Corollary 8.2.9 Assume MA+^CH and let (P, <) be a ccc partially or- 
dered set. If ijp a : a £ A) is an uncountable sequence of elements of P 
then there exists an uncountable subset A of A such that {p a : a £ A} is 
compatible in P. 

To state the next corollary, we need the following important definition. 
Let (Po, <o) and (Pi, <i) be partially ordered sets. Their product (P, <) is 
defined by P = Po x Pi and 

(po,Pi) < (qo,qi) Po <o <7o & Pi <i 91- 

Corollary 8.2.10 If MA-h-iCH holds then the product of two ccc forcings 
is ccc. 

Proof Let (P, <) be a product of ccc forcings (Po,<o) and (Pi,<i) and 
let ( ip a 5 9 a ) • a € A) be an uncountable sequence of elements of P. By 
Corollary 8.2.9 used for the forcing P 0 and a sequence (p a : a £ A), we 
can find an uncountable subset A of A such that the set {p a : a £ A} is 
compatible in Po- Then there are different a, (3 £ A such that q a and qp 
are compatible in Pi, since Pi is ccc. It is easy to see that ( p a ,q a ) and 
{pp, qp) are compatible in P. □ 

The last theorem of this section shows that Corollary 8.2.10 is false 
under CH. In its proof we will use the following lemma, which is the main 
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combinatorial tool needed to prove that different kinds of forcing (built 
with finite sets) are ccc. 

Lemma 8.2.11 (A-system lemma) If A is an uncountable family of finite 
sets then there exists an uncountable subfamily Ao of A and a finite set A 
such that Ink = d for every distinct X,Y £ Ao . 

Proof Since |.4| > u> we may assume that there is an n < u> such that 
each element of A has exactly n elements. The proof is by induction on n. 

By our assumption we must have n > 0. (Otherwise A C {0} has 
cardinality < 1 < ui.) If n = 1 then elements of A must be pairwise 
disjoint and the theorem holds with A = 0 and Ao = A. So assume that 
n > 1 and that the theorem holds for (n — l)-element sets. Consider two 
cases. 

Case 1: There exists an Aq £ A such that Ao intersects uncountably 
many A £ A. Then there exists an a £ Ao such that a belongs to uncount- 
ably many A £ A. In particular, 

B = {A \ {a} : A £ A & a £ A} 

is an uncountable family of sets of size n— 1, and by the inductive hypothe- 
sis, we can find an uncountable Bo C B and a finite set B such that Cfl-D = 
B for every distinct C,D £ Bo- But then do = {C U {a} : C £ Bo} C A is 
uncountable and X fl Y = B U {a} for every distinct X,Y £ Ao- 
Case 2: For every A £ A the set 

S A = {B £A: AnB^H} 

is at most countable. Construct, by transfinite induction, a one-to-one 
sequence (A ? : £ < uf) of pairwise-disjoint subsets of A. This can be done 
since for every £ < u>\ the set 

{B £ A: B fl ^ 0 for some ( < = [J Sa ( 

C<C 

is at most countable. Then Aq = f and A = 0 satisfy the 

desired requirements. □ 

In the next theorem, due to Galvin (1980), for sets A and B we will use 
the notation 

A (g) B = {{a, b} : a £ A & b £ B}. 

Theorem 8.2.12 If CH holds then there are two ccc forcings such that 
their product is not ccc. 
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Proof The forcings will be constructed as follows. We will construct a 
coloring function /: [wi] 2 — > 2. Then, for i < 2, we put Ki = f~ 1 (i ) and 
define 

P,: = {PG [wi] <u; : [FfcK,} 
ordered by reverse inclusion D. 

First we notice that the forcing Po x Pi is not ccc, since the family 
{({£D {CD G Po x Pi : £ < wi} forms an antichain. This is the case because 

compatibility of ({CD {CD ancl ({CD {CD im P lies ({C, CMC, CD e Po * Pi, 

and this is impossible since Po D Pi = [uq]- 1 is disjoint from [uq] 2 . 

Thus it is enough to construct an / such that the forcings P» are ccc. 
We will first translate the ccc property of Pj into a condition that will be 
appropriate for the inductive construction of /. 

So fix i < 2 and let (Fj € Pi : £ < u>\). To prove that Pj is ccc we will 
have to find £ < £ < uq such that and F, \ are compatible, that is, that 

[F ( U F ? ] 2 C Ki. 

First notice that by the A-system lemma we can assume, choosing a sub- 
sequence, if necessary, that for some F G [uq] <a; 

D = F for all £ < £ < uq . 

But [F c U F ? ] 2 = [F c ] 2 U [Fj,] 2 U {F c \ F) 0 (F ? \ F) and [F c ] 2 U [F^] 2 C K,, 
Thus we must find £ < £ < uq such that 

\ n 0 (/•{ \ /•') c K;. 

Replacing Fj with Fj \ F, we notice that it is enough to prove that for 
every sequence (Fj G Pj : £ < oq) of pairwise-disjoint nonempty sets there 
are £ < £ < oq such that 

F ( C Ki. 

Moreover, for every a < uq such that \J n<u F n C a there exists a £ < oq 
with F = F% C oq \ a. Thus we can reduce our task by showing that for 
every sequence (F n G Pj : n < u>) of pairwise-disjoint nonempty sets there 
exists an a < uq with [j n<UJ F n C a such that for every F G [uq \ a] <UJ 

3 n <lo (F n (g iFc K t ), 


that is, that 


3 n < w V/3 G F„ V7 G F (/({/?, 7}) = *)• (8-21) 

Now, by induction on £ < uq, we will construct an increasing sequence of 
partial functions /: [£] 2 — > 2 such that the entire function /: [uq] 2 — > 2 will 
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satisfy condition (8. 21). 2 For this, let {(F^) n<w : £ < uq) be an enumeration 
of all sequences (F n ) n< u of pairwise-disjoint nonempty finite subsets of uq 
such that each sequence appears in the list uq times. Such an enumeration 
can be chosen by CH, since the family of all such sequences has cardinality 
< |([cui] <a, ) w | = c. The construction will be done while maintaining the 
following inductive condition for every £ < uq: 

(/{) For every i < 2, a < £, and F € [£ \ a ] <u , if Un<w c a then the 
set 

£(i, a, F) = {F % : V/3 G (/({/?, 7 }) = 0} (8-22) 

is infinite. 

Notice that this will finish the proof, since then f will satisfy (8.21) 
for every i < 2 and every sequence ( F n ) n<w of pairwise-disjoint nonempty 
finite subsets of uq . To see this, choose a < uq such that F n C a 

and (F%) n<w = (F n ) n<u , and for every F G [uq \ a\ <u find a £ < uq 
such that F G [£ \ a] <w . Then, by (7^), there exists an n < ui such that 
F£ G £(i,a,F). This n satisfies (8.21). 

To make an inductive step, let rj < uq be such that the construction is 
already made for all £ < r\. 

If r\ is a limit ordinal, then /: [?y] 2 — > 2 is already constructed and it is 
easy to see that / satisfies (I v ). So assume that r/ = £ + 1. We have to 
extend / to {{/?, £}: (3 < £} while maintaining (I v ). So let 

F=\{£{i 1 a,F),i):i< 2, a < ?, fG^af, |J Ca 

n<cj 

Clearly T 7 is countable, being indexed by a countable set. Note also that 
if (£(i,a, F),i) G F then £(i,a,F) is infinite. For a < £ this follows 
directly from the inductive assumption (/{). But if a = £ then i 7 = 0 and 
£(*, a, i 7 ) = £(*, a, 0) = {F“ : n < u>} is infinite as well. 

Let ((£ m , i m ) : m < u>) be an enumeration of T with each pair appearing 
infinitely many times. Since each £ m is an infinite family of pairwise- 
disjoint finite sets, we can construct by induction on m < u> a sequence 
{E m : n < io) of pairwise-disjoint sets such that E m G £ m for every m < 
u>. Define /({/?, ^}) = i m for every (3 G E m and m < uq and extend it 
arbitrarily to [ry] 2 . It is enough to show that f satisfies condition (I v )- 

So let i < 2, a < ri = £+l> and F G [?y\a] <u; be such that (J„< u C a. 
If F = 0 then £(i,a,F) = {F“: n < cu} is infinite, and (8.22) holds. So 


2 Formally we are defining an increasing sequence of functions /£ : [£] 2 — ► 2, aiming for 
/ to be their union. But such an additional index would only obscure a clear idea. 
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assume that F 0. Now, if £ ^ F then (8.22) holds for F by (J|), since 
0/f'C(\a implies a < £. So assume that £ € F and let F = F \ {£}. 
Then (£(i,a,F),i) £ T and (£(i,a, F),i) = (£ m ,i m } for infinitely many 
m < ui. In particular, for every such to we have 

f({P,Z}) = im = i (8.23) 

for every (3 £ E m . We claim that every such E m belongs to £(i, a, F). This 
will finish the proof, since all sets E m are different. But for every f3 £ E m 

/({A 7}) = i 

holds for 7 £ F = F \ {£}, since E m £ £ m = £(i, a, F), and for 7 = £ by 
(8.23). This finishes the proof. □ 

EXERCISES 

1 Let X and Y be nonempty sets. Show that the forcing (Func w (X, Y),d) 
is ccc if and only if |Y| < w. Hint: Use the A-system lemma. 

2 A subset Z of R has strong measure zero if for every sequence (e n : n < w) 

of positive numbers there exists a sequence ( J n : n < u>) of open intervals 
such that each J n has length less than e n and Z C U„,<w ' Assuming MA 

show that every Z £ [K] <c has strong measure zero. Hint: Let B be the 
family of all open intervals with rational endpoints. For every (e n : n < w) 
of positive numbers use forcing 

P = { J £ B n : n < u & for every k < n the length of .7 ( k) is less than £/ c } 
ordered by reverse inclusion D. 

3 (Due to Solovay) Infinite sets A and B are said to be almost disjoint 
provided A fl B is finite. Show that MA implies the following fact: If A 
is a family of almost-disjoint subsets of u), |M| < c, and C C A, then there 
exists a set S' C w such that S fl C is finite for every C £ C and S D A is 
infinite for A £ A \ C. Hint: Use a forcing P = [w] <w x [C] <u> ordered by 
( s , £) < ( s £') if and only if s D s', £ D £', and (s \ s 4 * * 7 ) fl (J £ ' = 0. 

4 We say that a set A is almost contained in B and write A C* B if A \ B 

is finite. Let B C [u} u be such that f) Bo is infinite for every finite Bo C B. 

If MA holds and \B\ < c show that there exists an A £ [ lo\ u such that 

A C* B for every B £ B. 
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5 Assume MA and let B be a family of almost-disjoint subsets of to such 
that \B\ < c. Show that for every A £ [B]- u there exists adCw such that 
a C * d for every a £ A and d f! b is finite for every b £ B \ A. Hint: Let 
A* = {a\n: n < u> & a £ A} and define P = [A*] <aJ x [Z?\A] <a; , ordered by 
(A, B) < (A', B ') if and only if A D A', 5 D B\ and (J(A\ A') n (JB' = 0. 
Use the forcing P to define the set d as the union of all sets (J A with (A, B) 
from an appropriate generic filter in P. 

6 (Challenging) Assume MA and let n < c be an uncountable regular 
cardinal. Show that for every family A of countable subsets of k such that 
|A| < c there exists a B £ [k] k such that A n B is finite for every A £ A. 
Hint: Use a forcing similar to that from Exercise 3. Use the A-system 
lemma to prove that it is ccc. 

7 (Challenging) Generalize Exercise 6 as follows. Assume MA and let 
k < c be an uncountable regular cardinal. Show that for every family A 
of countable subsets of k such that |A| < c there exists a countable cover 
{P n : n < uj} of k such that A D P n is finite for every A £ A and n < u>. 
Hint: If (P, A) is the partially ordered set used in Exercise 6, use the forcing 
P* = Func w (w,P) ordered by s < t if dom(s) D dom(f) and s(n) A t(n ) for 
every n € dom(f). 


8.3 Suslin hypothesis and diamond principle 

Let ( X , <) be a linear order. For a, b £ X, an open interval with endpoints 
a and b is defined in a natural way as (a,b) = {x £ X : a < x < 6}. A 
linearly ordered set X is ccc provided every family of pairwise-disjoint open 
intervals in X is at most countable or, equivalently, when the partial order 
(r\ {0}, C) is ccc, where r is an order topology on X, that is, the topology 
generated by the family of all open intervals in X. 

In this terminology, Theorem 6.2. l(i) says that (R, <) is ccc. 

Now let us turn our attention to Theorem 8.1.4. It says that a linearly 
ordered set (X, <) is isomorphic to (M, <) if and only if it has the following 
properties. 

(a) X has neither a first nor a last element, and is dense in itself. 

(b) X is complete. 

(c) X contains a countable dense subset. 

In 1920 Suslin asked whether, in the preceding characterization, condi- 
tion (c) can be replaced by the weaker condition 

(c') X is ccc. 
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The Suslin hypothesis SH is the statement that such a replacement in- 
deed can be made, that is, that any ccc linearly ordered set ( X , <) satisfying 
(a) and (b) contains a countable dense subset. 

In what follows we will use the term Suslin line for a ccc linearly ordered 
set ( X , <) satisfying (a) that does not contain a countable dense subset. 
Thus a complete Suslin line is a counterexample for the Suslin hypothesis. 
In Exercise 1 we sketch the proof that the existence of a Suslin line im- 
plies the existence of a complete Suslin line. Thus the Suslin hypothesis is 
equivalent to the nonexistence of a Suslin line. 3 

In what follows we will show that the Suslin hypothesis is independent 
of the ZFC axioms. We start with the following theorem. 

Theorem 8.3.1 MA+^CH implies the Suslin hypothesis. 

Proof Assume MA+->CH and, to obtain a contradiction, that there ex- 
ists a Suslin line ( X , <). Then the partially ordered set (P, <) = {J, c) is 
ccc, where J is the family of all nonempty open intervals in (X, <}. We 
will show that the product forcing P x P is not ccc, contradicting Corol- 
lary 8.2.10. 

For this, we construct by induction a sequence ((a^,b^, C{) € X 3 : £ < u\) 
such that for every £ < u>i 

(i) aj < < c^, and 

(ii) (a 4 ,c f ) D {5 C : C < C} = 0. 

Such a construction can be easily made, since for every £ < uq the countable 
set {6^: £ < £} cannot be dense, so there must exist a £ < c $ in X such 
that (a£,C{) n { b £ < £} = 0. Thus such an element b £ can be chosen, 
since X is dense in itself. 

Now let t/j = ((a$ , &{), (&£, C{)) € P x P. It is enough to show that 
and Ug are incompatible in P x P for every ( < £ < u-i. This is the case 
since by (ii) either b( < a^, in which case (a^, b^) fl (a^, b j) = 0, or b q > cj, 
in which case fl (&£, cj) = 0. □ 

The remainder of this section is devoted to the proof that the existence 
of a Suslin line is consistent with ZFC. 

Notice that in the proof of Theorem 8.3.1 we used MA+->CH only to 
conclude that the product of ccc forcings is ccc, from which we deduced 

3 In the literature it is probably more common to find the term “Suslin line” used for a 
ccc linearly ordered set that, considered as a topological space with the order topology, 
does not have a countable dense subset. However, every such ordering can be extended 
to a Suslin line in the sense defined here (see Exercise 2). Thus, independently of which 
meaning of Suslin line is used, the Suslin hypothesis is equivalent to the nonexistence of 
a Suslin line. 
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that there is no Suslin line. Moreover, from Theorem 8.2.12 we know that 
CH implies the existence of ccc forcings whose product is not ccc. Can we 
generalize this argument to construct a Suslin line ( X , <) under CH? 

In what follows we will describe some difficulties that await any at- 
tempted inductive construction of a Suslin line. Then we will show the 
way to overcome these problems. In particular, the next lemma will be 
used to achieve both of these goals. It will illustrate the aforementioned 
difficulties, and will be used in our construction of a Suslin line. To state 
the lemma, we need the following notation, which will be used for the 
remainder of this section. 

For a < u>\ the symbol A Q will denote the ath limit ordinal number 
in u>i + 1. Thus {A Q : a < 07} is a one-to-one increasing enumeration of 
the set of all limit ordinals in 17 + 1. In particular, Ao = 0, Ai = lu, 
X2 = u> + u >, . . . , and 

A Wl = u>i and A a = | J A/3 for every limit ordinal a < u>\. 

fi<a 

Also, for a < aji let Q q = {A a + n: n < u>} and let < Q be a linear-order 
relation on Q q giving it the order type of (Q, <). Notice that A a +i = 
A a U Q a for every a < u> 1 and that 

a < A q = [J{Qc : C < 0} for every a < 07. 

Lemma 8.3.2 Let a < u>±. If ( Sp C A/ 3 : (3 < a) and (-<pC ApxAp:(3 < a) 
are such that for every (3 < 7 < a 

(1) ^0 is a linear-order relation on Ap; 

(2) Sp is a proper initial segment of (Ap, ~3p); 

(3) ^7= Ui< 7 (— s) if 7 is a limit ordinal ; 

(4) if 7 = £ + 1 then the relation ;< 7 on A 7 = Aj U Qj is defined from A £ 

by “sticking” the entire (Q^,<^) between S £ and A £ \ S^; that is, ^ 7 
extends ^ U <£ and is defined for every ( x , q) £ A^ x Qj by: x ^ 7 q 
if x £ Sj, and q ^ 7 x if x £ A{ \ ; 

then for every 7 < a 

(I) (A 7 , ^ 7 ) is a linearly ordered set that is dense in itself and has neither 

a largest nor a smallest element. 

Moreover, if a = 07 then (07, A Wl ) does not have a countable dense subset. 
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Proof Condition (I) is easily proved by induction on a < uq. Its proof is 
left as an exercise. 

To see that (uq,^^) does not have a countable dense subset take a 
countable set D C uq. Then there is a £ < uq such that D C Aj. Choose 
a,b € such that a b. Then, by (4), there is no d £ D such that 
a < Ul d -< Ul b. So D is not dense. □ 

The linear order (uq, constructed as in Lemma 8.3.2 has already 
all the properties of a Suslin line, except possibly that of being ccc. Thus 
the question is, Can we make it ccc? 

Clearly, we have the freedom of choice of initial segments S a . Such a 
choice can “kill” a potential uncountable antichain ((«£,&<;): £ < uq) by 
making sure that, starting from some a < uq, all Qj for £ > a will be 
placed inside some (a^, b^) for f < a. This would imply that there are only 
countably many points of uq outside U<;<a( a 0^c)’ making it impossible 
for {{a^,b(f): ( < uq) to be an antichain. There is, however, a problem 
in carrying out such a construction using only CH, since we have an in- 
duction of only length u> i, and we have to “kill” 2 Ul potential antichains 
(( a^,b ^) : £ < uq). Thus once more we face the problem of having a “too 
short induction.” Moreover, Theorem 8.3.1 shows that Martin’s axiom 
can’t help us this time. Someone may still have a hope that, through some 
trick, we can reduce the number of steps necessary to rescue this construc- 
tion. Indeed, this is what we will do. However, such a “trick” can’t be 
found with the help of CH alone, as it is known that the existence of a 
Suslin line cannot be concluded just from the continuum hypothesis. Thus 
we will need another “magic” axiom, which will show us the way out of our 
dilemma. To formulate it, we need some new definitions. 

A subset C of u q is closed if for every S C C its union [J S is in CU{uq} 
or, equivalently, when every limit ordinal A < uq is in C provided C fl A is 
unbounded in A. (In fact, being closed in uq is equivalent to being closed in 
the order topology of uq.) For example, a nonzero ordinal number a < uq 
is closed in uq (as a subset) if and only if it is a successor ordinal. 

In what follows we will be primarily interested in closed unbounded 
subsets of uq (see Section 5.3). Notice that these sets are quite big, as 
follows from the next proposition. 

Proposition 8.3.3 If C and D are closed, unbounded subsets of uq then 
C fl D is also closed and unbounded. 

Proof Clearly, C fl D is closed, since for every S C C fl D its union |J S 
is both in C U {uq} and in D U {uq}. 

To see that C fl D is unbounded in uq, pick a < uq and define by 
induction the sequences ( 7 „ £ C: n < lu) and (8 n £ D: n < u) such that 
< 7n < h n < 7 , 1+1 for every n < u>. It can be done since sets C and 
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D are unbounded in uJi. Then (3 = \J n<ul ln = U n < w < oj 1 belongs to 
C (1 D and is greater than a. □ 

One of the most important facts we will use about the closed unbounded 
sets is the following proposition. 

Proposition 8.3.4 Let T be a countable family of functions of the form 
f: iOi — > lo\ or f : w" — > [a>i]- w with n < u>. If 

D = {a < u >\ : a is closed under the action of IF} 

then D is closed and unbounded. 

Proof It is easy to see that D is closed (compare the proof of Lemma 6. 1.6) . 
To see that it is unbounded pick f3 < u>± and let i: w i — > [wi]-" be given 
by i(a) = a = {/3: (3 < a} € . Then, by Lemma 6.1.6(a) used with 

Z = (3, there is a countable subset Y of that is closed under the action 
of T U {!}. In particular, closure under the action of {!} implies that Y is 
an ordinal number. So Y is a countable ordinal number containing f3 and 
belonging to D. □ 

A subset S of is stationary if S fl C ^ 0 for every closed unbounded 
set C C wi. The theory of stationary sets is one of the most beautiful and 
useful parts of set theory. However, due to lack of space, we will not be 
able to develop it here in full. Some traces of its power can be found in the 
rest of this section and in the exercises. 

The additional axiom we will need is called the diamond principle and 
is usually denoted by the symbol <C>. It reads as follows. 

Diamond principle There exists a sequence (A a C u>\ : a < oq), 
known as a If -sequence, such that for every A Cwi the set 

{a < u>i : 4fla = A a } 


is stationary. 

Notice that 

Proposition 8.3.5 implies CH. 

Proof Let Acw. Since C = \ u is closed and unbounded, there exists 

an a £ C such that A a = An a = An u> = A. Thus { A a flw: a < u>i} = 

□ 

In the next chapter we will show that <C> is consistent with the ZFC 
axioms. It is also true that it does not follow from CH. Thus we have the 
following implications, none of which can be reversed: 


<C> => CH =>■ MA. 
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As we have just seen a <C>-sequence lists all subsets of uj. But it also 
captures a lot of information regarding uncountable subsets of uq. This will 
be enough to overcome the obstacles to the construction of a Suslin line 
described earlier. For the construction we need the following easy lemma. 

Lemma 8.3.6 If holds then there exists a sequence (B a Caxa: a< oq) 
such that for every B C oq x uq the set 

{a < u>i : B C\ (a x a) = B a } 


is stationary. 

Proof Let fo,fi- w i — > c<q be such that /: oq — > a>\ given by /(£) = 
</o(0./i(0) is a bijection and let B a = f[A a ] n (a x a). Notice that, 
by Proposition 8.3.4 used with the family T = {/o, /i, / _1 }’ the set D = 
{a: f[a\ = a x a} is closed and unbounded. 

To see that the sequence (B a : a < uq) has the desired properties, take 
B C lo i x and let C C uq be closed and unbounded. We must find an 
a € C such that B a = B n (a x a). 

Since C ft D is closed and unbounded and {a < uq : f~ 1 (B) fla = A a } 
is stationary, there is an a € Cf\D such that A a = f~ 1 (B)r la. But a € D 
implies that 

A a = f~ 1 (B) D a = f^ 1 {B) fl / _1 (a x a) = f^ 1 (B fl (a x a)). 
Hence B a = f[A a ] fl (a x a) = B fl (a x a). □ 

Theorem 8.3.7 If holds then there exists a Suslin line (X, A). In par- 
ticular, the Suslin hypothesis fails. 

Proof We will construct ( X , A) = (uq, ^ Wl ) as in Lemma 8.3.2, following 
the idea described after its proof. The sequence (B a C a x a: a < uq) 
from Lemma 8.3.6 will be used as an “oracle” that will tell us how to choose 
initial segments S a so as to “kill” all potential uncountable antichains. 

Thus we will construct inductively the sequences (S a C A a : a < uq) 
and (A a C A a x A„ : q < uq) such that for every /3 < a < uq 

(1) is a linear-order relation on A Q ; 

(2) S /3 is a proper initial segment of (\p, A^); 

(3) A a = U/3< a (— p) if ct is a limit ordinal; 

(4) if a = £ + 1 then < a extends U <{ and for every (x, q) € A^ x Qj 

is defined by: x A a q if x £ S$, and q x if x £ \ S £. 
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Note that by Lemma 8.3.2 we know that such a construction can be 
made and that ( X , ■<) obtained this way is linearly ordered, is dense in 
itself, has neither a largest nor a smallest element, and does not have a 
countable dense subset. So it is enough to show that we can choose sets 
S a such that ( X , <) will be ccc. 

So assume that for some a < the sequences (Sp C A p: (3 < a) and 
{■<pC. Ap x Ap: (3 < a) are already constructed. We must construct the 
proper initial segment S a of A a . It will be done as follows. 

For a,b £ A a let (a, b) a be an open interval in (A a , ^ a ) with the end- 
points a and 6, that is, 

(a, b) a = {i€A a : a -< a x -< a b }, 


and for £ < a let 

q = {(a,b) a : (a, 6) £B c }. 

Notice that C £ x £ C A a x A a , so is well defined. Define the family 
T a as 

{C“ : £ < a and C“ is a maximal family of disjoint intervals in (A a , ^«)} 

and let {£ n : n < w} = T a U {C}, where C is an arbitrary maximal family 
of pairwise-disjoint intervals in (A a , ^). (We add C to this list just to avoid 
a problem when T a = 0.) 

The idea of the proof is that the elements of U a <wi approxi- 

mations of all the potential families of pairwise-disjoint intervals in (X, <). 
We will make sure that every family C £ T a remains maximal after step 
a by “sticking” every Qp for a < [3 < u>\ inside some interval I £ C, thus 
adding no new elements of X outside (J C. 

So define, by induction on n < oj, a sequence ((a n , b n ) £ A a xA a : n < u) 
such that for every n < u> 

(A) a -< a a n -< a b n -< a b for some (a, b) a £ £ n \ and 

(B) a n -\ a n -< a b n -< a b n - 1 for n > 0. 

Such a construction can be made since, by Lemma 8.3.2, (A a , A a ) is dense 
in itself and, by the maximality of £ n , the interval (a„_i, b n -i) a must 
intersect some (a, b) a £ £ n . We define S a = {x £ A a : 3n < to (x a n )}. 
This finishes the construction. 

Now notice that if C“ £ T a for some £ < a < u>\, then there are 
n < u> and a, 6 £ A a such that a -< a a n -< a b n -< a b, ( a,b) a £ C“, and 
a n £ S a but b n tfL S a . In particular, Q a C (a n ,6 n ) a +i C an< i C^ +1 

remains a maximal family of pairwise-disjoint intervals in (A a +i, A a+ i). So 
C “ +1 e F a +\. 
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Using this, we can easily prove by induction on (3 < u>\ that for every 
a < u>i 

if C“ G T a and a < (3 < u>i then U and d G Tp. 

a<7</3 

This, in particular, implies that U a < 7 < Wl Q 7 C IJC ^! 1 provided C“ G T a , 
that is, that 

C ^ 1 is a maximal family of disjoint intervals in ( X , A), if G T a . (8.24) 

To see that ( X , A) is ccc assume, to obtain a contradiction, that there is 
an uncountable family Aq of nonempty pairwise-disjoint intervals in (X, A). 
Using Zorn’s lemma we can extend Ao to a maximal family A of nonempty 
pairwise-disjoint intervals in (X, A)- Put B = {( a,b ): ( a,b) UJl G A}. The 
maximality of A means that for every a, b G X, with a -< 6 , there exists 
a nonempty interval (l, r) LJl = (l(a, b), r(a, b)) Ul G A intersecting (a, b) Ul . 
Let l,r: u>\ — > be functions having the foregoing property and let 
A: tui — » tui be given by A(£) = + 1. Put 

D = {a < loi : a is closed under the action of {A, l, r}}. 

By Proposition 8.3.4, D is closed and unbounded in u>i. Moreover, for 
every aGO, 

X a = a, (8.25) 

and 

T^a = {( l , r ) a : (l,r)Gfin(ax a)} (8.26) 

is a maximal family of pairwise-disjoint intervals in (X a , -< a ) = (a, ^ a ). 

Condition (8.25) follows from the fact that a is closed under the action 
of A. To see why, notice first that a is a limit ordinal, since ^ + 1<A^ + 1 = 
A(£) < a for every £ < a. Now A a = Uf< a \ — a — 

Condition (8.26) follows from the closure of a under the actions of l and 
r, since for every a, b G a = X a , with a -< a b , there are l = l(a, b) < a and 
r = r(a,b) < a with (l,r) Wl G A intersecting (a, 6 ) Wl . This clearly implies 
that ( l,r) a G V a intersects (a, b) a , making V a maximal in (X a ,^ a ) = 
Aa)- 

By the choice of the sequence {B a C a x a: a < u>\) there exists an 
a G D such that B n (a x a) = B a . Hence, by (8.25) and (8.26), 

= {{hr) a : ( l,r ) G B a } = {( l,r) a : ( l,r ) GBn(ax«)} = D„ 

is a maximal family of pairwise-disjoint intervals in (ct, Aa) = (A a ,Aa)- 
Hence, by the definition of T a , C“ G T a . So, by (8.24), C ^ 1 is a maximal 
antichain in (X, A)> contradicting the fact that is a proper subset of 
an uncountable antichain A. Thus we have proved that (X, A) is ccc. □ 
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EXERCISES 


1 Let ( X , <) be a Suslin line and let X* be the family of all proper 
nonempty initial segments of X without last elements. Prove that (X*, c) 
is a complete Suslin line. 


2 Let (X, <) be a ccc linearly ordered set and assume that X, considered 
as a topological space with the order topology, does not have a countable 
dense subset. Show that there exists an X 0 C X such that (X 0 ,<) is a 
Suslin line and for every a,b £ Xq, with a < b, there is no countable dense 
subset of (a,b) fl X 0 . Hint: Define an equivalence relation ~ on X by 
putting x ~ y if and only if the interval between them contains a countable 
subset that is dense in it with respect to the order topology. Choose X 0 to 
be any selector from the family of all equivalence classes of from which 
the least and the greatest elements are removed, if they exist. 


3 Complete the proof of Lemma 8.3.2 by proving condition (I). 

4 Prove that if sets C n C uq are closed and unbounded then H n <w ( -' n 
closed and unbounded. 


5 Let {C a C uq: a < u q} be a family of closed unbounded sets. Prove 
that the set 


D = {7 < uq : 7 G C a for all a < 7}, 

known as the diagonal intersection of the sets C a , is closed and unbounded. 


6 Prove the following theorem, known as the pressing-down lemma or 
Fodor’s theorem: 

Let S C uq be a stationary set and let / : S — > uq be such that 
/( 7) < 7 for every 7 £ S (such a function is called a regressive 
function). Then there exists an a < uq such that f~ 1 ({a}) is 
stationary. 

Hint: Otherwise for every a there exists a closed and unbounded set C a 
disjoint from / -1 ({a}). Consider the diagonal intersection D of the sets 
C a and show that it is disjoint from S. 
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7 A tree is a partially ordered set (T, <) with a smallest element such that 
the relation < is well founded, that is, that every initial segment of T is 
well ordered. A Suslin tree is an uncountable tree that, considered as a 
forcing with the reversed order (T, <), has neither uncountable chains nor 
uncountable antichains. Show the existence of a Suslin line implies the 
existence of a Suslin tree. Hint: Start with a Suslin line (Xo, <) as in 
Exercise 2. Define, by induction on £ < uq, a sequence (T^: £ < uq) such 
that To = Xq and each for £ > 0 is a maximal family of pairwise-disjoint 
intervals in Xq, none of which contain any endpoint of any interval from 

Then T = U{< Wl ordered by reverse inclusion D is a Suslin 

tree. 

8 Show that the existence of a Suslin tree (T, <) implies the existence of a 
Suslin line. (Thus the existence of a Suslin line is equivalent to the existence 
of a Suslin tree.) Hint: Let A be any linear order on T and let X be the 
family of all strictly increasing functions / from an ordinal number a into 
(T, <) such that f[a] is a maximal chain in T (such an / is called a branch 
of T). For different f,g € X define / < g if /(/?) A g(/3), where /? is the 
smallest ordinal for which f(/3) yf g{/3). Show that (X,<) is a ccc linearly 
ordered set such that X, considered as a topological space with the order 
topology, does not have a countable dense subset. Then use Exercise 2. 

9 For a tree (T, <} and an ordinal number a, the a-level of T is the set 
of all t G T for which the initial segment generated by t has order type a. 
(Thus any level of T is an antichain.) An Aronszajn tree is an uncountable 
tree with no uncountable chains and all levels countable. (Thus every Suslin 
tree is an Aronszajn tree.) Show, in ZFC, that there exists an Aronszajn 
tree. Hint: For /, g: a w write / =* g if the set {/3 < a: f{(3) yf g((3)} is 
finite. Define, by induction on a < aq, a sequence of one-to-one functions 
(s Q G oj a : a < u>i) such that u> \ range(So,) is infinite and s a |/3 =* sp for 
every j3 < a. Define T = {s G ui a : a < uj\ & s =* s a } and relation < as 
reverse inclusion D. 



Chapter 9 

Forcing 


In this chapter we will describe a technique for proving that some set- 
theoretic statements are independent of the ZFC axioms. This technique 
is known as the forcing method. We will not prove here all the theorems 
needed to justify this method. (Sketches of some of the missing proofs are 
included in Appendix B. The complete proofs can be found, for example, 
in Kunen (1980).) Instead, we will describe only the essentials for its use 
and concentrate on its applications. 1 


9.1 Elements of logic and other forcing pre- 
liminaries 

We will start here with some definitions, which will serve as technical tools 
to develop the forcing method. 

A set M is said to be transitive if x C M for every x £ M, that is, if 
a £ x and x £ M imply that a £ M. 

Lemma 9.1.1 For every set x there exists a smallest transitive set trcl(:r) 
such that x C trcl(x). 

Proof Define, by induction on n < u>, 

Uq{x) = x and U n +i(x) = 1J U n {x) for n < oj. 

1 The material included in Appendix B is not essential for the applicability of the forcing 
method and can be completely skipped. However, those interested in reading it should 
consider waiting at least until the end of this section, since the material included here 
should make Appendix B easier to follow. 
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Then trcl(x) = \J n<UJ U n (x) is the desired set. It is transitive, since for 
every y £ trcl(a;) there is an n < u> such that y € U n {x). Then y C 
U U„(x) = U n+ i(x) C trcl(a;). 

To prove the minimality of trcl(x) it is enough to take a transitive set 
M with x C M and show, by induction on n < u>, that U n C M for every 
n < u>. The details are left as an exercise. □ 

The set trcl(x) from Lemma 9.1.1 is called the transitive closure of x. 
For an ordinal number a define inductively a sequence (R(/3) '■ (3 < a) 
by putting 

(a) R( 0) = 0, 

(b) R{(3 + 1) = V{R{f3)) for (3 < a, and 

(c) R( A) = U/3<a R(P) for every limit ordinal A < a. 

Note that the sequence (R(/3) : (3 < a) is increasing, that is, that 

R(/3) C I?(q) for every ordinal numbers 7 < (3 < a. ( 9 - 1 ) 

An easy inductive proof of this fact is left as an exercise. 

Lemma 9.1.2 For every set x there exists an ordinal number a such that 
x £ R(a). 

Proof First notice that for any set x, 

(★) if for every y £ x there exists an ordinal number a such that y £ R(a), 
then x £ R(a o) for some ordinal «o- 

To see this, let r be a function such that r(y) = min{/3: y £ R{(3 + 1)} 
for every y £ x. Such a function exists by the axiom of replacement. 
Let 7 = U y ex r (y)- Then 7 is an ordinal number and x C f?( 7). So 
x £ V(R( / y)) = f ?(7 + 1). 

Now, to obtain a contradiction, assume that there exists an x such that 
x R(a) for every ordinal number a. So, by (*), there exists a y £ x such 
that y R(a) for every ordinal number a. In particular, the set 

A = {y £ trcl(a:) : y fL R(a) for every ordinal number a} 

is not empty. So, by the axiom of regularity, A contains an G-minimal 
element, that is, there exists an xq £ A such that if l xo = 0. But Xq £ 
A C trcl(x). Hence xo C trcl(x), since trcl(x) is transitive. Therefore xo 
satisfies the assumption of (*) so Xo £ R(a 0 ) for some ordinal number «o- 
But this contradicts xq £ A. □ 
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For a set x define the rank of x by 

rank(x) = min{/3: x G R{(3 +1)}. 

Notice that by Lemma 9.1.2 rank(x) is defined for every set x. 

Lemma 9.1.3 

(a) rank(x) < a if and only if x € R(a). 

(b) If x € y then rank(x) < rank(y). 

Proof (a) This is obvious by (9.1) and the definition of rank(x). 

(b) If P = rank(y) then y G i?(/3 + 1) = V(R{fd)). So x G y C R{(3). 
Therefore rank(x) < (3 = rank(y). □ 

Before we describe the forcing method, we will reexamine some notions 
discussed in Sections 1.1 and 1.2. This should give us a better under- 
standing of what we mean when we say that a property is “independent of 
ZFC.” 

In what follows the term “formula” will always be understood as a 
formula of the language of set theory, that is, the language described in 
Section 1.2. In particular, the set of all formulas is defined by induction on 
their length from only two basic kinds of formula: “ x G y ” and “x = y ,” 
where symbols x and y represent variables. A more complicated formula 
can be built from the less complicated formulas p and ip only by connecting 
them with logical connectors, p&ip, p V ip, p—>ip, p<->ip, or by preceding 
one of them with the negation, —up, or a quantifier, 3xp, \/xp. For example, 
u x = y & 3x \/y ( x G y)” is a correct formula, whereas expressions such 
as u x G y G z" or 11 z = f V c” are not. (Although “iGyG z” could be 
interpreted as “x G y & y G zf the latter being a correct formula.) 

A variable in a formula is any symbol that represents a set. More 
precisely, it is any symbol that is neither of the following: “G,” “=,” 
“V “V,” “3,” “V,” “(,” or “).” In a given formula 

not all variables must be alike. For example, in the formula p defined as 
“x = y & 3 z (z = z)” the variables x and y are parameters and p might 
be either true or false, depending on what value we associate with x and 
y. On the other hand, the variable 2 : lies within the scope of the quantifier 
“3” and is not a parameter. 

The variables of a formula p that are within the scope of a quantifier 
are said to be bound in p. The parameters of a formula will be referred to 
as free variables. Notice that the same symbol in the same formula may be 
used both as a bound variable and as a free variable. For example, this is 
the case in the formula “3x (x G y) & x = yf where the first x is bound, 
while the second plays the role of a free variable. (Notice the importance 
of the distribution of parentheses.) 



9.1 Elements of logic and other forcing preliminaries 


167 


Formulas without free variables are called sentences. Since all variables 
in a sentence are bound, no interpretation of free variables is necessary to 
decide whether it is true or false. For this reason only sentences will be used 
as axioms. On the other hand, if aq, . . . ,x n are free variables of a formula 
p, we often write p{x i, . . . ,x n ) instead of p to emphasize its dependence 
on xi, . . . ,x n . In particular, we will usually write 3 xp(x) and \/xp(x) in 
place of 3xp and \/xp. 

By a theory we will mean any set of sentences. The sentences belonging 
to a theory T will be treated as its axioms. 2 In particular, ZFC is a theory, 
and it consists of infinitely many axioms since each of the scheme axioms 
(comprehension or replacement) stands for infinitely many axioms. 

We say that a sentence p is a consequence of a theory T (or that p can 
be proved in a theory T) and write it as T b p if there is a formal proof of 
p using as axioms only sentences from T. It is also equivalent to the fact 
that there exists a finite set To = {ipo , ... , ip n } of axioms from T such that 
(t/>o& ' ’ ’ &,ipn)—*P is a consequence of axioms of logic. (See Appendix B for 
more details.) 

A theory T is said to be inconsistent if there is a sentence p such that 
T b (p&z-ip), that is, if it leads to a contradiction. Equivalently, T is incon- 
sistent if every sentence ip is a consequence of T. (The equivalence follows 
from the fact that the implication (p&z-ip)—*^ is true for all formulas p 
and ip.) Conversely, we say that a theory T is consistent and write Con(T) 
if T is not inconsistent, that is, if T does not imply a contradiction. A 
sentence “ip” is consistent with theory T if T+“ip ” is consistent. 

Evidently, from the point of view that theories should carry useful in- 
formation, only consistent ones are interesting. In particular, we will be 
assuming here that ZFC is consistent. (Recall that by the Godel’s second 
incompleteness theorem, Theorem 1.1.1, there is no hope of proving the 
consistency of ZFC in itself as long as it is really consistent.) 

The relation between the notions of “being a consequence of a theory” 
and “being consistent with a theory” is best captured by the following fact, 
whose easy proof is left as an exercise: 

T Y- p if and only if Con(T + “-ip”). (9-2) 

Recall also that a sentence p is independent of theory T if neither T h p 
nor T b -ip or, equivalently, when both theories T + “p” and T + “—>p” 
are consistent. Thus Godel’s first incompleteness theorem, Theorem 1.1.2, 
implies that there are sentences of set theory that are independent of ZFC. 

2 We assume that theory contains only sentences, since it is more difficult to accept a 
formula with some free variables as an axiom. (Intuitively whether it is true or false can 
depend on the value of the free variables.) However, logically any formula ip is equivalent 
to a sentence Vau • • • \/x n <P, where { x i . . . . , x n } is a list of all free variables in ip. So our 
restriction is not essential. 



168 


9 Forcing 


The forcing method will be a tool to show that a given sentence ip is 
consistent with ZFC. So, in order to prove that a sentence ip is independent 
of ZFC, we will be proving that the theories ZFC+ “ip” and ZFC+“-ii/;” are 
both consistent. 


EXERCISES 

1 Complete the details of the proof of Lemma 9.1.1 by showing the mini- 
mality of trcl(a:). 

2 Prove (9.1). 

3 Prove (9.2). Hint: Use the fact that (T U {<£>}) b ip if and only if 
T b (<£->-0). 


9.2 Forcing method and a model for -iCH 

Consider a transitive set M. For every formula ip of the language of set 
theory (without any shortcuts) we define a formula ip M , called its relativiza- 
tion to M . It is obtained by replacing in ip each unbounded quantifier \/x 
or 3x with its bounded counterpart \/x € M or 3a; £ M. For example, if 
ipo is a sentence from the axiom of extensionality 

\/x \/y \Mz (z £ a £ y)—>x = y] 

then ipQ 1 stands for 

\/x £ M My £ M \Mz £ M (z £ a :<-*z £ y)—>x = y] . 

In particular, if ip{x i, . . . ,x n ) is a formula with free variables Xi, . .. ,x n 
and t\,... ,t n £ M then ip M (t±,... ,t n ) says that ip(t\,... ,t n ) is true 
under the interpretation that all variables under quantifiers are bound to 
M. In other words, ip M (ti , ... ,t n ) represents the formula ip(ti , ... ,t n ) as 
seen by a “person living inside M” that is, thinking that M represents the 
entire class of all sets. (This is the best way to think of ip M .) 

For a transitive set M and a formula ip (with possible parameters from 
M ) we say that “ip is true in M” and write M \= ip if ip M is true. For a 
theory T we say that “T is true in M” or that “M is a model for T” if 
ip M holds for every ip from T. 

Note that a model for a theory T is just a set satisfying some properties. 3 
Thus, if M is the empty set, then M is a model for the theory T consisting 

3 For those readers who have been exposed previously to any kind of model theory we 
give here a bit of an explanation. We do not need to interpret any constants of set 
theory in M, since the language of set theory does not contain any constants. And the 
only relations that we have to take care of are the relation symbols “E” and which 
are interpreted as the real relations of “being an element of” and “being equal to.” 
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of all the axioms of ZFC except the set existence and the infinity axioms. 
This is so since any other axiom starts with a general quantifier \/x, and its 
relativization to M = 0 starts with \/x € 0, that is, is satisfied vacuously. 
Similarly, if M is any transitive set, then the axiom of extensionality is 
satisfied in M, since for any set x £ M 

z £ x if and only if z £ M & z £ x. 

It is also relatively easy to see that M = R(u)) is a model for ZFC minus 
the infinity axiom (see Exercise 1). 

Forcing consistency proofs will be based on the following fundamental 
principle. 

Forcing principle In order to prove that the consistency of ZFC implies 
the consistency of ZFC+“V’” it is enough to show (in ZFC) that 

(F) every countable transitive model M of ZFC can be extended to 
a countable transitive model N of ZFC+ “if . ” 

We use here the word “extend” in the sense of inclusion, that is, N extends 
M means that M C N. 

In what follows we will use the letters CTM as an abbreviation for 
“countable transitive model.” Thus the forcing principle asserts that if we 
can prove in ZFC the implication 

[3M(M is a CTM for ZFC)] =^[3 N(M C N & N is a CTM for ZFC+“V>”)] 

then we can conclude from it that Con (ZFC) implies Con {ZFC + u ip”). 

We will not prove the forcing principle here, since its proof is not im- 
portant for the applications of the forcing method. However, a sketch of 
its proof can be found in Appendix B. 

To use the forcing principle we will assume that we have a countable 
transitive model M of ZFC and then will extend it to an appropriate 
model N. To describe the extension method we need a few more defi- 
nitions. Let M be an arbitrary family of sets and let P be a partially 
ordered set. A filter G in P is M -generic if D fl G ^ 0 for every dense 
subset D of P that belongs to M. Notice that the Rasiowa Sikorski lemma 
(Theorem 8.1.2) immediately implies the following lemma since the family 
{D £M:flcP&Bis dense in P} is countable for any countable set M. 

Lemma 9.2.1 For every partially ordered set P, p £ P, and countable set 
M there exists an M -generic filter G in P such that p £ G. 

The model N from (F) will be constructed by using the following the- 


orem. 
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Theorem 9.2.2 For any countable transitive model M of ZFC, partially 
ordered set P = (Po,<) G M, and M-generic filter G in P there exists 
a smallest countable transitive model N of ZFC such that M C N and 
GgN. 

The model N from Theorem 9.2.2 is usually denoted by M[G\ and is 
called a generic extension of M. We will also refer to M as the ground 
model of M[G\. 

Before we present a construction of M\G\ we will have a closer look at 
Theorem 9.2.2. To make good use of it we must expect that a countable 
transitive model M for some property if (such as CH) can be extended to a 
model M\G] of its negation -i if. For this, M[G\ must be a proper extension 
of M. However, Theorem 9.2.2 does not rule out that M[G\ = M. In fact, 
if G £ M then clearly M[G] = M. Fortunately, the extension will be proper 
for a large class of partially ordered sets described in Proposition 9.2.3. 

Proposition 9.2.3 Let M be a transitive model of ZFC and let P € M be 
a partially ordered set such that 

(*) for every p € P there are two incompatible q, r £ P below p. 

If G is an M-generic filter in P then G e/L M . 

Proof Notice that (*) implies that D = P\ G is dense in P. Now if D £ M 
then, by the definition of an M-generic filter, we would have G n D ^ 0, 
which is impossible, since GC\D = Gfl(P\G) = 0. So D £ M. But P £ M 
and M is closed under taking the difference of two sets, since it is a model 
of ZFC. Therefore G M, since otherwise D = P \ G £ M. □ 

Without any doubt the reader should notice a similarity between Mar- 
tin’s axiom and the statement of Theorem 9.2.2. In fact, the generic filter 
G from the theorem will be used in M[G\ in a way similar to the way we 
used generic filters in the proofs where Martin’s axiom was used. To see the 
similarities as well as differences between such uses in both cases consider 
the following example. 

Example 9.1 Let M be a countable transitive model of ZFC. For sets X 
and Y from M consider the forcing P(X, Y) = (Func w (X, Y), D). Then 
P(X, Y) belongs to M, being defined from X, Y £ M. It is easy to see that 
if X is infinite then the sets 

D x = {p £ P(-X", Y) : x € dom(p)} for x € X 

and 

R y = {p £ P(X, Y): y £ range(p)} for y £ Y 
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are dense in P(X, Y). They belong to M, since each of them is defined 
from P(X, Y) and either x G X C M or y G Y C M. In particular, if G 
is an M-generic filter in P(X, Y) then, by Proposition 8.1.1, g = |J G is a 
function from X onto Y . 

In the proof that -^CH is consistent with ZFC we will use the forcing 
Po = P(w 2 x ai, 2). Then, for an M-generic filter Go in Po, we see that 
g = U Go : 0 J 2 x to — > 2 belongs to M[Go]. Moreover, notice that for every 
Q < £ < u >2 the sets 

E\ = {p G P 0 : 3n < ui [(C, n), (£, n) G dom(p) & p( (, n) ^ p(£, n)]} 

are dense in Po and belong to M. Thus an M-generic filter intersects every 
such set, that is, 

for every £ < £ < w 2 there exists anrKu such that g((, n ) ^ g(£, n). (9.3) 

Now for every ( < w 2 we can define a function : to — > 2 by g^(n) = g(t;, n). 
The set {g^: £ < w 2 } belongs to M[G], since it has been defined only 
from g G M[G]. Moreover, by (9.3), all functions g j are distinct and 
£ < ^ 2 } C 2 W . So we have proved that (in M[G 0 ]) the family 2“ 
contains a subset {55: ^ < w 2 } of cardinality w 2 , so c = |2“| > w 2 > u>i. 
That is, in M[Go] the continuum hypothesis is false. 

Is this really all we need to prove? Is it that simple? Unfortunately the 
preceding argument contains several imprecisions and gaps. To see this let 
us consider another forcing P* = P(w,wi) = (Func a ,(w, u>i), D) and let G* 
be an Af-generic filter in P*. Then, as we noticed before, g = (J G* is a 
function from ui onto lui ! This looks like a clear contradiction. What is 
wrong? Is Theorem 9.2.2 false? 

The answer is that there is nothing wrong with Theorem 9.2.2. What 
is wrong with our “contradiction” is that there is a misconception of what 
the theorem really says. It tells us that M[G] is a model for ZFC as long 
as M is such a model. In particular, ip M and ip M ^ G 1 are true for every 
sentence ip that is a consequence of ZFC axioms. However, the constants 
are not a part of the formal language of set theory, and we use them only 
as shortcuts for the formulas representing them. Thus we write “ x is equal 
to u >1” to express the fact that x is the unique set satisfying the formula 
“x is the smallest uncountable ordinal number.” Theorem 9.2.2 
tells us that the sentence if defined by 3U p(x) is true both in M and 
in M[G], since the existence of a unique u>i can be proved in ZFC. Thus 
ip N holds for N = M or N = M[G], and there exists a unique ui^ G N 
such that ip N {uji) is true, that is, that uii is an “uq with respect to N .” 
However, Theorem 9.2.2 does not tell us that uif 1 = In fact, since 

M is countable and transitive, every element of M is countable. Thus by 
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manipulating M from outside we may be able to take any set x £ M that 
is uncountable in M and find a generic extension M[G] of M in which x 
is countable. And this is precisely what happened in M[G*]\ The ordinal 
number co^ 1 became a countable object in M[G*], and is not equal to 
^m[g ]. ar g Umen t that indeed caf 1 ^ ^ will be completed after 

Lemma 9.2.5.) 

In fact, so far we are not even sure whether what is an ordinal number 
in M remains so in M[G\, or even whether what is 2 in M remains 2 in 
M[G\. To address this issue we have to examine our set-theoretic vocab- 
ulary more carefully. In particular, the general question we would like to 
examine is, Which properties of elements of M are preserved in its exten- 
sion M[G}7 (This can be viewed as an analog of the preservation problems 
for transfinite-induction constructions discussed in Section 6.1.) 

To address this issue recall once more that every set-theoretic term we 
use can be expressed by a formula of set theory. So the question may be 
rephrased as follows: 

For which properties P does there exist a formula ip{x i, . . . , x n ) 
describing P such that if ippti , . . . ,t n ) is true in M for some 
ti, . . . ,t n € M then ip(ti, ... ,t n ) is also true in M[G]1 

The properties that are always preserved in such extensions will be called 
absolute properties. More precisely, a property is absolute provided it can 
be expressed by a formula ip(x\,... ,x n ) such that for every transitive 
models M and N of ZFC with M C N and for every t\,...,t n £ M 
formula ip M (ti , ... ,t n ) holds if and only if ip N (t \, ... , t n ) holds. 

The absoluteness of most properties we are interested in follows just 
from the fact that the models we consider are transitive. To identify a 
large class of such properties we need the following notion. A formula ip is 
a A 0 -formula if it can be written such that the only quantifiers it contains 
are bounded quantifiers, that is, quantifiers of the form 3a: £ y or \/x £ y. 

Lemma 9.2.4 Let M be a transitive set and let ip{ x\,... ,x n ) be a Ao- 
formula. Ifti, . . . ,t n £ M then ip M (t \, . . . , t n ) is equivalent to ip{t \, . . . , t n ) 
in the sense that the sentence 

\/M[M is transitive — > (Vti , ... ,t n £ M){if M (t \, ... , t n ) <-> ip(ti , . . . , t n ))} 
is provable in ZFC. In particular, the property expressed by ip is absolute. 

Proof The proof goes by induction on the length of formula ip. 

Clearly it is true for any basic formula x £ y or x = y, since for any 
quantifier-free formula ip its relativization ip M is identical to ip. 

So assume that formula ip is built from the less complicated formulas 
(fio and <p i, that is, ip is in one of the following forms: V (pi, 



9.2 Forcing method and a model for ~^CH 


173 


Po~ v Pii Po •<— >v?i , — ></?o 5 3s £ t po, or Va : £t po- For example, if ip = p\ V pi 
and the lemma is true for all formulas of length less than the length of ip, 
then it holds for po and p i. In particular, 

ip M = (ipo V Pi) M = Po 1 V p™ = po V pi = ip. 

We argue similarly if ip is the negation of a formula, or is obtained from 
two formulas by using the logical operations &, — >, or <->. 

If ip is of the form 3x £ t po(x ) for some t € M then its formal represen- 
tation is 3a: [x € t & po(x)\. Then ip M stands for 3a; € M [x € t & Pq 1 ( x)\, 
which is equivalent to (3a; € tC\M)p 0 (x) since po(x) is equivalent to P(f{x) 
by the inductive assumption. But t £ M and M is transitive. So t C M 
and tO M = t. Therefore ip M is equivalent to 3a; € t po{x), that is, to ip. 
The case when ip is of the form Va; £ t po(x) is similar. □ 

For example, the property “y is a union of x,” y = (J x, is absolute, 
since it can be written as the following Ao-formula ipi(x,y): 

\/w £ y3z £ x{w £ z) Sz\/z £ x\/w £ z(w £ y). 

Lemma 9.2.5 The following properties can be expressed by a A a -formida. 
In particular, they are absolute. 

(0) x C y. 

(1) y = (J x. 

(2) y = 0- 

(3) z = xUy. 

(4) z = x Piy. 

(5) z = x\y. 

(6) z={x,y}. 

(7) z is an unordered pair. 

(8) z = {x,y) (i.e., z = {{x},{x,y}}). 

(9) 2 is an ordered pair. 

(10) z = x x y. 

(11) r is a binary relation. 

(12) d is the domain of binary relation r. 
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(13) R is the range of binary relation r. 

(14) f is a function. 

(15) Function f is injective. 

(16) < is a partial-order relation on P (i.e., (P, <) is a partially ordered 
set). 

(17) D is a dense subset of the partially ordered set (P, <}. 

(18) A is an antichain in the partially ordered set (P, <). 

(19) Set x is transitive. 

(20) a is an ordinal number. 

(21) a is a limit ordinal number. 

(22) a is the first nonzero limit ordinal number (i.e., a = uj). 

(23) a is a finite ordinal number. 

(24) a is a successor ordinal number. 

Proof The Ao-formulas for these properties can be found in Appendix B. 

□ 

Now, for a property P expressed by a formula if(x), a transitive model 
M of ZFC, and t € M the statement “t has the property P” may be inter- 
preted in two different ways: as either that t satisfies the formula ip(x), that 
is, that ip(t) holds, or that ip(x) is satisfied by t in M, that is, that ip M (t) 
is true. In the latter case we will say that “t has the property P in M,” 
reserving the statement “f has the property P” for the former case. How- 
ever, for the absolute properties, such as “a is an ordinal number,” such 
a distinction is redundant. Thus, we will say “a is an ordinal number” 
independently of whether we think of a as an ordinal number in M or 
outside it. 

A similar distinction has to be made when we talk about constants. 
When we write u> we think of it as “w is the first infinite ordinal number.” In 
principle, an object u> M that satisfies this property in M might be different 
from the “real” w. However, for objects represented by absolute formulas 
this cannot happen. So in all these cases we will drop the superscript M 
from the symbol representing an absolute constant, even when we think of 
it as being “in M.” In particular, we will use the same symbol u> regardless 
of the context, since oj m = u. For the cases of the constants that are not 
absolute, such as wi or c, we will keep using the superscript M to denote 
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their counterparts in M. For example, we will write ujf r and c M to express 
oj\ and c in M . 

In fact, we have already used this convention when arguing that ^ 
with G* being an M-generic filter in P* = (Func a ,(w, uq), D). And 
now we can indeed see that lv^ 1 ^ \ This is so since and uj^ G ^ 

are (real) ordinal numbers, and in M[G*] there is a (real) function / from 
(true) u> onto So lu^ 1 is a countable ordinal in M[G*}, whereas ^ 

is an uncountable ordinal in M[G *] by its definition. Thus they cannot be 
equal. 

Proof of Theorem 9.2.2 To construct the set M[G] fix a countable tran- 
sitive model M of ZFC, a partially ordered set P G M, and an M-generic 
filter G in P. 

We say that r is a P-name provided r is a binary relation and 
V(cr,p) G t (a is a P-name & p € P). 

Formally, we define the property “r is a P-name” recursively by induction 
on rank(r). Thus r is a P-name if it satisfies the following recursive formula: 

there exists an ordinal number a and a function / such that dom(/) = 
a + 1, /(a) = {t}, and for every (3 G a + 1, t € /(/?), and zGf there 
exist 7 G (3, a G /( 7 ), and p G P such that z = ( a,p ). 

It is not difficult to see that if is absolute (see Exercise 2). So the property 
“t is a P-name” is absolute too. 

The set of all P-names that belong to M will be denoted by M p , that is, 
M p = {r G M : r is a P-name}. 

It can be proved that the class of all P-names is a proper class. Thus M p 
is a proper class in M. It particular, it does not belong to M. 

For a P-name r we define the valuation of r by 

val G (r) = {val G (cr): 3p G G ({ a,p ) G r)}. 

Once again, the formal definition of val G (r) should be given by a recursive 
formula. Now we define M[G\ by 

M[G\ = {val G (r) : r G M p }. 

This is the model from the statement of Theorem 9.2.2. 

We will not check here that all ZFC axioms are true in M[G], since it 
is tedious work and has little to do with the forcing applications. However, 
we will argue here for its other properties listed in Theorem 9.2.2. 
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Transitivity of M[G\ follows immediately from the definition of M[G\. 
Take y G M[G\. Thus y = val G (r) = {val G (er) : 3 p £ G (( a,p ) G r)} for 
some t G M p . So any x € y is in the form val G (cr) for some (a,p) G r G M. 
Thus (j is a P-name that belongs to M, and x = vale (o') G M[G\. 

The minimality of M[G] is clear, since every element of M[G } is of 
the form val G (r) and so must belong to any transitive model of ZFC that 
contains G and r. 

To see that M[G ] is countable notice first that it is a set by the axiom 
of replacement since vale is a function defined by a formula and M[G\ = 
vale [M p ] . It is countable, since M p C M is countable. 

To argue for M C M[G\ and G G M[G\ we need a few more definitions. 
For a set x define, by recursion, its P-name x by 

x = {{y , p) : y G x & p G P}. 

(Note that the definition of x depends on P.) 


Lemma 9.2.6 If M is a countable transitive model of ZFC, P G M is a 
partially ordered set, and G is an M-generic filter over P then for every 
x G M 


x G M p and val G (x) = x. 


In particular, M C M[G\. 


Proof It is an easy inductive argument. First prove by induction on 
rank(x) that a: is a P-name. Then notice that the definition of x is absolute 
and x G M. So x G M. Finally, another induction on rank(x) shows that 
vale (a;) = x. The details are left as an exercise. □ 

To see that G G M\G\ let T = {(p,p ) : p G P}. It is clearly a P-name 
defined in M from P. So T G M p . Moreover, 


val G (r) = {val G (p) : 3p G G «p,p) G T)} = {p G P: p G G} = G. 

So G G M[G}. 

This finishes the proof of Theorem 9.2.2. □ 

The last important basic problem we have to face is how to check that a 
given property <p is true in M[G]. For this, we will introduce the following 
definition. For a formula <p(x i,... ,x n ), a countable transitive model M 
of ZFC, a partial order P G M, and P-names t\, . . . , r n G M we say that 
p G P forces ip(r i, ■ ■ ■ , r n ) and write 


P \\~p,m <p{ri , ... ,T n ) 

provided </?(val G (ri), . . . ,val G (r„)) is true in M[G] for every M-generic 
filter G in P that contains p. We will usually just write ||— in place of 
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||— r,M when IP and M are fixed. We will also write P ||— . . . , r n ) if 

p ||— <p(ri, • • • , T n ) for every p £ P. Notice that 

if p ||- ip(r 1 , ... , T n ) and q < p then q ||- <p(n,. . . , r„), (9.4) 


since every M-generic filter G that contains q contains also p. 

Clearly, the definition of the relation ||— depends on a model M and 
a set P and involves the knowledge of all M-generic filters in P. Thus, it 
is clearly defined outside M. One of the most important facts concerning 
forcing is that an equivalent form of ||— can also be found inside M, in a 
sense described in the following theorem. 

Theorem 9.2.7 For every formula Lp{x i, . . . ,x n ) of set theory there exists 
another formula P,x\, .. . ,x n ), denoted by it \\—* P <p(x\,... ,x n ), such 
that for every countable transitive model M of ZFC, partial order P € M, 
and P -names t\ , . . . , r n £ M 


P || p,m ¥>(n, ■■■ ,t„) M \= {p ||-p ip(n, . . . ,r n )) (9.5) 

for every p £ P. Moreover, 

M[G] |= </?(val G (Ti), . . . ,val G (r n )) <t^> 3p £ G(p ||- tp(n, . . . ,t„)) (9.6) 

for every M-generic filter G in P. 

This theorem will be left without proof. 

The relation ||— * from Theorem 9.2.7 will justify the use of all forcing 
arguments within a model M and will let us conclude all essential properties 
of M[G\ without knowledge of G. We will often write ||— where formally 
the relation ||— * should be used. 

This completes all general details concerning the forcing method. How- 
ever, before we give a proof of Con (ZFC + ~^C Ft) we still need some techni- 
cal lemmas. The first of them tells us that a P-name representing a function 
with domain and range belonging to a ground model can be chosen to have 
a particularly nice form. 

Lemma 9.2.8 Let M be a countable transitive model of ZFC and P £ M 
be a partially ordered set. If X,Y £ M, pg £ P, and a £ M v are such that 
Po ||— “<t is a function from X into Y” then there exists a r £ M p such 
that 

(a) t C {((x,y),p): x £ X & y £Y & p G P & p < p 0 }; 

(b) po || — cr = t-; and 

(c) A x = {p £ P: 3y £ Y (((x,y),p) £ r)} is an antichain for every 
x £ X. 
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Proof For x G X let 

D x = {p G P: p < po k By G Y (p ||- (x,y) G a)}. 

Notice that D x G M since, by condition (9.5) of Theorem 9.2.7, 

D x = {p G P: p <p 0 k 3y G Y (p ||-* (x,y) G a) M }. 

Let A x be a maximal subset of D x of pairwise-incompatible elements. Thus 
every A x is an antichain. Define 

t = {((x,y),p): x G X k y GY k p G A x k p ||- (x,y) G a}. 

Once again r G M follows from (9.5), since 

r = {((x, y),p)'- x G X k y G Y k p G A x k (p ||— * (x, y) G a) M } 

and the family {A x \ x G A'} can be chosen from M. 

Clearly r is a P-name satisfying (a). 

To see (c) it is enough to notice that A x = A x for every x G X. This is 
the case since 

pGA x 3 y GY ((x,y),p) G r 

^ 3y G Y (p G A x k p ||— (x, y) G cr) 
p G A x . 

To prove that Po \\~ & = t, let G be an M-generic filter in P such that 
Po € G. Then g = vale (o’) is a function from X into Y, since po forces it. 
Also, clearly val^r) C X x Y. So let a; G A and y G Y. It is enough to 
show that (x,y) G val g(< j ) if and only if (x,y) G valc(T). 

But if (x, y) G valc(r) then there exists a p G G such that ((x, y),p) G r. 
In particular, p ||— (x,y) G a. But p G G. Therefore (x,y) G valuer). 

To prove the other implication first note that the set 

E x = {q < po : 3p G A x (q < p)} UjgGP: q is incompatible with po} 

is dense in P. To see it, let s G P. We have to find q G E x with q < s. 

If s is incompatible with po then q = s G E x . So assume that s is 
compatible with po and choose pi G P extending s and po- Let H be an 
-M-generic filter in P containing p-| . Then h = val// (a) is a function from X 
into Y since the condition p 0 G H forces it. Let y = h(x). Then, by (9.6), 
there exists an r G H forcing it, that is, such that r ||— (x,y) G er. Choose 
P 2 G H extending r G H and pi G H. Then, by (9.4), p 2 ||— {x,y) G a. In 
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particular, p 2 £ D x . But A x is a maximal antichain in D x . So there exists 
a Q < P 2 extending some p from A x . Hence q < P 2 < Po, so q £ E x , and 
q < P 2 < Pi < s. The density of E x has been proved. 

Now let (a;, y) € val g(«t). Then, by (9.6), there exists an r € G that 
forces it, that is, such that r ||— (x,y) £ cr. Moreover, there exists a 
q £ E x fl G, since G is M- generic and E x £ M is dense. But q £ G is 
compatible with po £ G. So there exists a p £ A x C D x with q < p. Thus, 
by the definition of D x , there exists a t/i £ Y such that p ||— (x, yf) £ a. 
In particular, ((x,yi),p) £ r and (x, yi) £ valc(T), since p £ G. To finish 
the proof it is enough to notice that y± = y. So let s £ G be a common 
extension of r, p, and po- Then, by (9.4), 

s ||— x,y ) £ a & (x,yi) £ a & a is a function^. 

Therefore, if H is an M -generic filter in P containing s then h = val h(&) 
is a function and y = h(x) = yi- □ 


Corollary 9.2.9 Let M be a countable transitive model of ZFC and P £ M 
be a partially ordered set that is ccc in M . Let G be an Ad-generic filter in 
P, and let g £ Ad[G\ be such that g: X — » Y and X, Y £ M. Then there 
exists a function F £ M such that 

Ad \= F: X ^ [Y}^ M 

and g(x) £ F{x) for all x £ X. 

Proof Let a £ Ad be a P-name such that vale (o’) = g. Since 

Ad[G\ 1= g is a function from X into Y, 

by Theorem 9. 2. 7(9. 6) there exists a po £ G such that 

Po |(— a is a function from X into Y. 

Let r £ Ad be a P-name from Lemma 9.2.8 and define 

F(x) = {y £ Y: 3p v £ P (((x,y),p v ) £ r)}. 

Then clearly F £ Ad and Ad \ = F: X — » V(Y). To see that for every 
x £ X the set F(x) is countable in M note that the map g: F(x) — > A x , 
g{y) = p y , is one-to-one. 

Indeed, if y,z £ F(x) and p v — p z then p = p y < po and so 
p ||— ^(x, y) £ t & (x, z) £ t & r is a function j . 
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Thus y = z. In particular, 

M\=\F(x)\ < 14,1 <w, 

as P is ccc in M. Finally, since g = val( 5 (cr) = valc(T) 

y = g(x) ^3 p£G [((x,y),p) erj=>i/e F{x). 

Corollary 9 . 2.9 has been proved. □ 

Next we will show that ccc forcings are very nice with respect to the 
cardinal numbers. To formulate this more precisely we need a few defini- 
tions. 

Let M be a countable transitive model of ZFC, P £ M be a partial order, 
and ft G M be an ordinal number such that M |= “ft is a cardinal number.” 
If M[G } \= “ft is a cardinal number” for every M -generic filter G in P then 
we say that forcing P preserves the cardinal k. On the other hand, if 
M[G\ \= “ft is not a cardinal number” for every M- generic filter G in P 
then we say that forcing P collapses the cardinal ft. If P preserves all 
cardinal numbers from M then we simply say that P preserves cardinals. 

Lemma 9.2.10 Let M he a countable transitive model of ZFC and P £ M 
he a partial order that preserves cardinals. If G is an M -generic in P then 
< = for every ordinal number a £ M . 

Proof This follows by an easy induction; it is left as an exercise. □ 

Theorem 9.2.11 Let M he a countable transitive model of ZFC. If P 6 M 
is a partial order such that M 4 “P is ccc” then P preserves cardinals. 

Proof Let M and P be as in the theorem and assume, to obtain a contra- 
diction, that there is an ordinal number ft and an M -generic filter G in P 
such that k is a cardinal in M but is not in M[G\. By the absoluteness of 
finite cardinals and ui it must be that n > u>. 

Since ft is still an ordinal in M[G\ it means that it is not initial, that 
is, there exist an ordinal a < ft and a function g £ M[G ] such that g 
maps a onto ft. So, by Corollary 9 . 2 . 9 , there exists a function F £ M 
such that M \= “F: a — > [«]-“” and g(f) £ F(f) for all £ £ a. Hence 
K = g[a] c Uj e « F (0- 

But U^gq; -^(0 £ since F G M. So the following holds in M: 


k < 


U F K) 

£Go: 


< |a| <g> u> < k, 
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since all sets F(£) are countable in M. This contradiction finishes the 
proof. □ 

To apply this theorem to the proof of the consistency of ZFC+^CH we 
need one more fact (compare Exercise 1 from Section 8.2). 

Lemma 9.2.12 Forcing P = (Func w (X, Y), D) is ccc for \Y\ < u>. 

Proof Let (p^: t; < uf) be a one-to-one sequence of elements of P. We 
have to find ( < £ < uj\ such that p ^ and p j are compatible. 

Consider the family A = {clom(p^) : £ < uq} of finite sets. Notice that 
A must be uncountable, since for every A £ A there is at most \Y A \ < co 
p^ s with A = dom(p^). So, by the A-system lemma (Lemma 8.2.11), there 
exist an uncountable set S C uq and a finite A with the property that 
dom(p^) ndom(pj) = A for any distinct But the set Y A is at most 

countable. So there exist distinct ( and £ from S such that P(\a = Ps]a- 
This implies that p = p^ U p^ is a function, that is, p £ P. Since p clearly 
extends p f and p ^ we conclude that p ^ and p ^ are compatible. □ 

Theorem 9.2.13 Theory ZFC+~<CH is consistent. 

Proof We will follow the path described in Example 9.1. Let M be a 
countable transitive model of ZFC. Define (in M) 

P = (Func w (w 2 f xw,2), D) 

and let G be an Ai-generic filter in P. We will show that c > u>i in M[G\. 
For x £ x u) let 


D x = {p £ P: x € dom(p)}. 

It is easy to see that all sets D x belong to M since they are defined using 
a:,P £ M. They are dense, since for every p £ P either x £ dom(p) 
and p £ D x , or x ^ dom(p) and pU{(i,0)} £ D x extends p. Thus, 
by Proposition 8.1.1, g = (J G is a function from uif f x u to 2. Clearly 
g £ M[G], since it is constructed from G € M[G\. 

Similarly as for D x we argue that for every ( < £ < the sets 

= {p £ P: 3n < u [(C ,n),(£,ri) £ dom(p) & p((,n) ^ p(£,n)]} 

are dense in P and belong to M . Thus an M- generic filter intersects every 
such set, that is, for every ( < £ < u there exists an n < ui such that 
g((,n) yf g(£,n). 

Now for every £ < we can define (in A I[G}) a function g^: u) — » 2 
by ff{(n) = g(£,n). The set { g f belongs to M[G\, since it has 
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been constructed only from g £ M[G\. Moreover, { g £ < lo) 1 } C 2“ and 
all the functions g ^ are different. So 

M[G ] |=c = |2“| >wf. 

But, by Lemma 9.2.12, forcing IP is ccc in M and so, by Theorem 9.2.7, it 
preserves cardinals. In particular, by Lemma 9.2.10, that is, 

M[G \ h c > wf [G] > w i. 


This finishes the proof. □ 

A model M[G\ obtained as a generic extension of a ground model M via 
the forcing IP = (Func w (A, 2), d) (with X £ M) is called a Cohen model. 
The numbers g j from Theorem 9.2.13 are called Cohen (real) numbers. 

EXERCISES 

1 Prove that x £ R{oj) if and only if | trcl(cc) | < oj. Use this to show that 
M = R(co) is a model for ZFC minus the infinity axiom. 

2 Let V’( r ) be a formula expressing the property that “r is a P-name” and 
let M be a CTM of ZFC with P £ M. Show, by induction on rank(r), that 
for every r £ M 

ip(r) if and only if tp M (t). 

Thus the property “r is a P-name” is absolute. 

3 Complete the details of the proof of Lemma 9.2.6. 

4 Let M be a countable transitive model of ZFC, P £ M be a partial 
order, and G be an M- generic filter in P. Show that < uia^ for every 
ordinal number a £ M. 

5 Prove Lemma 9.2.10. 


9.3 Model for CH and <0 

The main goal of this section is to prove the following theorem. 

Theorem 9.3.1 Let M be a countable transitive model of ZFC, and let 
P = (Func Wl (u>i,2)) M . If G is an M-generic filter in P then to^ 1 = 

V(io) ft M = V(u>) D M[G\, and f) holds in M[G\. 

Since implies CH the theorem immediately implies the following 
corollary. 
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Corollary 9.3.2 Theories ZFC+CH and ZFC +{ ) are consistent. 

Combining this with Theorem 9.2.13 we immediately obtain the follow- 
ing. 

Corollary 9.3.3 The continuum hypothesis is independent of ZFC. 

Note that P = Func Wl (cui, 2) stands for the set of all functions p : A — > 2, 
where A is an at most countable subset of u>i. Since the terms “wi” and 
“being countable” are not absolute, the superscript M in the definition of 
P in Theorem 9.3.1 is essential. 

We will prove Theorem 9.3.1 with the help of several general facts that 
are, in most cases, as important as the theorem itself. 

Let P be a partially ordered set and let p £ P. A set D C P is dense 
below p if for every q < p there exists a d £ D such that d < q. 

Proposition 9.3.4 Let M be a countable transitive model of ZFC, P € M 
be a partially ordered set, and p £ P. If D £ M is a subset of P that is 
dense below p then G D D ^ 0 for every M -generic filter G in P such that 
P eG. 

Proof Let if = {r £ P: r is incompatible with p}. Then Dq = DC R £ M 
and Dq is dense in P. So if G is an M-generic filter in P then G D Dq ^ 0. 
But if p € G then G H i? = 0, so G H D (/). □ 

A partially ordered set P is countably closed if for every decreasing 
sequence ( p n £ P : n < oj) there is a £ P such that < p n for every 
n < ui. Notice that P* = Func^ (uq , 2) is countably closed since p w = 
U,ko ,Pn G P extends every p n . So P = (Func Wl (uq, 2)) M is countably 
closed in M. 

Theorem 9.3.5 Let M be a countable transitive model of ZFC and P £ M 
be a partially ordered set that is countably closed in M. Let A,B G M be 
such that A is countable in M and let G be an M-generic filter in P. If 
f £ M[G\ is a function from A into B then f £ M. 

Proof Let r £ M be a P-name such that valc(T) = /. Then, by Theo- 
rem 9.2.7, there exists a p £ G such that 

p ||— r is a function from A into B. 


Define in M 


D = {q£P:3g£B A (, q ||- r = g)}. 

( D belongs to M since the relation ||— can be replaced by the formula ||— *.) 
By Proposition 9.3.4 it is enough to prove that D is dense below p since 
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then there exists a q € G fl D, that is, there is a g G M such that g maps 
A to B and 

/ = val G (r) = val G {g) = g G M. 

To prove that D is dense below p, fix r G P such that r < p. We have 
to find a q £ D with q < r. Let { a n : n < u>} be an enumeration of A in 
M. Define, in M, sequences (p n G P: n < to) and (b n G B: n < to) such 
that for all n < u 

(1) p 0 = r, 

( 2 ) Pn + 1 ^ Pm 

(3) p n+ 1 ||- r(a n ) = b n . 

To see that such a construction can be made assume that pi for i < n 
and bj for j < n have already been constructed. It is enough to show that 
there exist p n +i and b n satisfying (2) and (3). 

The proof of the existence of such p n +\ and b n will be done outside M. 
For this, let H be an M-generic filter in P with p n G H. Then h = val h(t) 
is a function in M[H\ from A into B, since p n < p forces it. Thus there is 
a b G B such that h(a n ) = b and we can find an r G H which forces it, that 
is, such that r ||— r(a„) = b. Refining r, if necessary, we can also assume 
that r < p n . Then p n+ 1 = r and b n = b satisfy (2) and (3). 

Now let p u G P be such that p w < p n for every n < ui and let g: A — > B 
be such that g(a n ) = b n . Then g G M and 

Plu ||- T («n) = K 

for every n < w. To finish the proof it is enough to show that 

Pco ||- T = 9 

since then q = p u < r and belongs to D. 

So let H be an arbitrary M-generic filter in P with p w G H . Then 
h = valfl-(r) is a function in M[H] from A into B. Moreover, h(a n ) = b n 
for every n < w since p u < p n forces it. In particular, 

M[H] b h = val h (t) = g 


that is, p u ||— r = g. □ 

Theorem 9.3.5 tells us that in a generic extension of a model M obtained 
from a countably closed forcing the extension will have the same countable 
sequences with elements from a fixed set from M . This implies the following 
corollary. 
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Corollary 9.3.6 Let M be a countable transitive model of ZFC and P £ M 
be a partially ordered set that is countably closed in M. If G is an M- 
generic filter in P then V(u>) D M = V(u>) fl M[G\ and uf 1 = uf 7 ^ . 

Proof Let (3 = uif 1 . If a < (3 then a is an ordinal number in M and in 
M[G\. Moreover, if a < (3 then there exists an / £ M C M[G\ such that / 
maps u> onto a. Thus every a < (3 is at most countable (in M and in M[G}). 
In particular, it is enough to show that (3 is uncountable in M[G\, since 
then it is the first uncountable ordinal in M[G], that is, = (3 = wf 7 ^. 

To see that f3 is uncountable in M[G\ , take / : u> — > (3 from M[G\ and 
notice that, by Theorem 9.3.5, / € M. So / cannot be onto (3 = uif 1 . Thus 
/ 3 is uncountable in M[G}. 

To see that V(u>) fl M = V(u>) fl M[G] it is enough to prove that 
V(u>) n M[G]c P(w)nM. So let A £ V(uj)r\M[G\ and let / = Xa '■ u> — > 2. 
Then / £ M[G } and, by Theorem 9.3.5, f £ M. Therefore A = / -1 ( 1) £ 
V{u) D M. □ 

Notice that Corollary 9.3.6 implies the statements urf 1 = and 

V(io) fl M = V(u>) fl M[G] of Theorem 9.3.1 since the forcing P from the 
theorem is countably closed in M. Notice also that Corollary 9.3.6 does 
not imply that countably closed forcings preserve all cardinal numbers. In 
fact, by proving Theorem 9.3.1 we will show that in M[G] there exists a 
bijection between tvf 1 = wf 7 ^ and V m (uj) = V M ^ G \u>), that is, there is 
a bijection between cof 1 and c M . Thus, if ujf r < c M (which can happen 
by a result from the previous section) then the cardinal c M is collapsed in 
M[G\. 

To prove that <C> holds in the model M\G\ from Theorem 9.3.1 the 
following lemma will be useful. 

Lemma 9.3.7 Let M be a countable transitive model of ZFC and let P, Pi £ 
M be partially ordered sets such that there is an order isomorphism f £ M 
between P and Pi. If G is an M-generic filter in P then G\ = f[G] is an 
M -generic filter in Pi and M[G] = M[Gi\. 

Proof It is easy to see that G\ = f[G] is a filter in Pi. To see that Gi 
is M-generic let D\ £ M be a dense subset of Pi. Then D = f~ 1 (D i) 
belongs to M and is dense in P. So D fl G ^ 0 and 

Di n Gi = f[D\ n f[G] = f[D n G] yf 0. 

To see that M[G\ = M[G\\ notice that M c M[G\ and M c M[Gi]. 
Moreover, Gi = f[G] £ M[G}\ thus, by the minimality of M[Gi], we con- 
clude M[Gi] C M[G\. Similarly, G = / _1 (Gi) £ Af[Gi]. Thus M[G } C 
M[Gi], □ 
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Lemma 9.3.7 tells us that if we replace a forcing by its isomorphic copy 
then the appropriate generic extensions will be identical. In particular, if 
X G M is such that M |= |X| = oj\ then forcings P = (Func Wl (oq, 2)) M and 
Pi = (Func Wl (X, 2 )) m are isomorphic in M (by an isomorphism F : P — » Pi 
with F(p)(x ) = where /: X — > uf 1 is a bijection with / G M). 

Before we prove that <C> holds in the model M[G\ from Theorem 9.3.1 
we will show that M[G\ \= CH. This is not used in the proof of M[G] \= 
However, the proof of CH in M[G\ is simpler than the one for <C>, and it 
will prepare the reader for the later one. Moreover, the reader interested 
only in the consistency of ZFC+CH can skip the latter part of this section. 

By Lemma 9.3.7 we can replace P with Pi = (Func wi (uq x u>, 2)) M . Let 
Gi be an M-generic filter in Pi. Thus M[G\ = M[G\]. Clearly g = (J G± 
belongs to M[G] and, by Proposition 8.1.1, it is a function from tof 1 x u> 
to 2. Notice that for every / G D M the set 

Df = {p Pi : 3a < wf 1 [{a} x lu C dom(p) & V?i < u) ( p(a , n) = /(n))]} 

belongs to M and is dense in Pi. (To see its denseness, choose p G Pi, 
find a < cof 1 with ({a} x to) fl dom(p) = 0, and notice that the condition 
p U {((a, n), f(n)) : n < ui} G Df extends p.) So if f a : to — > 2 for a < uf 1 
is defined by f a (n) = g(a,n) then, by Theorem 9.3.5, 

2 W n M C {f a : a < wf } C2“n M[G } =2“n M. 

Thus, in M[G\, 

= = | W M| = || /q . a < W M}| = |2« n M[G]| = C M[G] . 

So CH holds in M[G]. 

Proof of Theorem 9.3.1 The proof is similar to that of Theorem 9.3.5. 

Let X = {((,£,)'■ C < and replace P = (Func Ul (wi, 2)) M 

with P 2 = (Func Wl (X, 2)) m . We can do this by Lemma 9.3.7 since these 
forcings are isomorphic in M. Let G be an M-generic filter in P 2 and let 
g = (J G G M[G\. Then g is a function from X to 2. For a < let 
g a : a -> 2 be defined by g a (() = g{(,a) and put A a = g~ l {l). It is easy 
to see that the sequence {A a : a < to^ 1 ) belongs to M[G\. We will show 
that it is a ^-sequence in M[G\. 

So let A G M[G\ be such that A C = uf 1 and let / = Xa ■ We 

have to show that the set 

5={a<wf:ina = A a } = {a < ivf 1 : f\ a = g a } 

is stationary in M[G] (remember that = uf 1 ). For this, let C C ujf 1 , 

C G M[G\, be such that 

M[G) \= C is a closed unbounded subset of to\. 
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We will show that S fl G b 0, that is, that there exists an a G G such that 

f\a = 9a- 


Let t, a £ M be P 2 -names such that valqb) = / and vale (o') = G. 
Then there exists a p £ G such that 

p ||— (a is a closed unbounded subset of aq & t: uq — > 2). 

Define in M the set D by 

[d e P 2 : 3a < (q ||- (a G a & VC < a (r(C) = ([J r ) (C, ")))) } > 

where T is the standard P 2 -name for a generic filter. It is enough to prove 
that D is dense below p since then there exists a q £ G fl D and so there is 
an a £ valuer) = C such that 


/(C) = 5(C,a) for all f < a, 


that is, f\ a = g a . 

To prove that D is dense below p fix an r£P such that r < p. We have 
to find a q £ D with q < r. 

Define in M the sequences (p n € P 2 : n < u>), {/ 3 n < uf 1 : n < to), 
(S n < : n < ui), and (b n £ 2 ^ n : n < to) such that for every n < to 

(1) Po = r and 6 0 = 0; 

(2) dom(p„) C {(C, 0 ■■ C < ? < Pn}', 

(3) Pn < Pn- 1 if n > 0; 

(4) (3 n - 1 < Sn < Pn if n > 0; 

(5) Pn lb € cr & T \p n _ x = bn'j if U > 0. 

To see that such a construction can be made assume that for some n < u> 
the sequences are defined for al \i < n. If n = 0 then po and <5o are already 
constructed and and b 0 can be easily chosen, since the conditions (3), 
(4), and (5) do not concern this case. So assume that n > 0. We have to 
show that the inductive step can be made, that is, that appropriate p n , P n , 
8 n , and b n exist. 

So let Go be an M-generic filter in P 2 with p„_i € Go- Then f 0 = 
valc 0 (t) € M[Go] is a function from t vf 1 to 2 and Co = valG 0 (cr) is a 
closed and unbounded subset of uq in M[G 0 ], since p n -\ < p forces it. In 
particular, there exists a d £ Co with d > P n -i- Also, b = h\p n _ 1 £ M, 
since P 2 is countably closed. So there exists a q £ Go, with q < p n -i, that 
forces it. Then p n = q, b n = b, and 8 n = d satisfy (3), (5), and the first part 
of (4). The choice of /3 n satisfying (2) and (4) finishes the construction. 
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Now let = U„<o ,Vn G IF* 2 - Then for all n < u> 

Pu ||- (X+l G cr & T \p n = &n+l)- 

Thus the functions b n + 1 must be compatible, and b = {J n<ul b n + \ G M is a 
function from a = [j n<UJ f3 n = U„<u, into 2. 

Put q — p u U {((f,a),b(Q): C < a}. Notice that q is a function since, 
by (2), dom(p w ) is disjoint from {(£, a): £ < a). Since q < r it is enough 
to show that q belongs to D. 

So let G\ be an M - generic filter in P 2 containing q and let C\ = val^ (<r) 
and /i = valc^r). It is enough to show that a G C\ and ,/i(C) = 
(U r) (C> a) for every C, < a. But C\ is a closed unbounded subset of 
and 5 n +i G C± for every n < u>, since q forces it. Thus a = (J n<UJ 6 n G C\. 
Also, for every ( < a there exists an n < u) such that C, < (3 n . So 

/i(0 = va lG 1 (r)|/3 n (C) = b n+ i(C) = 6(C) = q(C,a) = ((J r ) (C.«)- 

This finishes the proof. □ 

The proof just presented can be easily generalized to the following the- 
orem. Its proof is left as an exercise. 

Theorem 9.3.8 Let M be a countable transitive model of ZFC, n be a 
regular cardinal in M, and P = (Func K +(« + x k, 2)) m . Then forcing P 
preserves all cardinals < k + and 2 K = n + holds in M[G\, where G is an 
M -generic filter in P. 

Theorem 9.3.8 implies immediately the following corollary, which will 
be used in Section 9.5. 

Corollary 9.3.9 It is relatively consistent with ZFC that 2 Ul = u > 2 . 

EXERCISES 

1 Complete the details of Lemma 9.3.7 by showing that for every isomor- 
phism / between partially ordered sets P and Pi and every G, D C P 

(1) G is a filter in P if and only if f[G] is a filter in Pj_; 

(2) D is dense in P if and only if f[D\ is dense in Pi. 

2 Show that the functions 6 ra +i from the proof of Theorem 9.3.1 are indeed 
compatible and that b = U n<w 6 ra +i is a function from a to 2. 



9. 4 Product lemma and Cohen model 


189 


3 Prove Theorem 9.3.8 in the following steps. 

(a) Generalize the proof of Theorem 9.3.5 to show that if f £ M[G\ is a 
function from k into A £ M then f £ M. 

(b) Use (a) and the ideas from the proof of Corollary 9.3.6 to show that 
P preserves cardinals < k + and V(k) D M = V(k) fl M[G\. 

(c) Show that the function /: n + — » 2 K , f(a)(/3) = (U G)(a, /?), is 
onto 2 K . 

9.4 Product lemma and Cohen model 

In the next section we will prove the consistency of MA+- 1 CH. The method 
used in its proof is called iterated forcing. The idea behind this method is 
to repeat the forcing extension process recursively, that is, to construct by 
transfinite induction of some length a a sequence 

M = M 0 C Mi c • • • C M ? c • • • C M a 

of countable transitive models of ZFC such that = M^[G^\ for every 

£ < a, where G % is an M %- generic filter in some forcing P^ £ 

This description is very specific and easy to handle at successor stages. 
The limit stage, however, presents a problem. For a limit ordinal A < a we 
cannot simply take M\ = U^<a since such an M\ does not have to be 
a model of ZFC. Intuitively, we can avoid this problem by defining M\ as 
a generic extension of M by a sequence (Gj : £ < A) . But in what forcing 
should the sequence (G % : £ < A) be a generic filter? 

To solve this problem, we will find one forcing in the ground model that 
will encode all forcings used in our intuitive inductive construction. In this 
project, however, we will find that the successor step creates more trouble 
than the limit one. In fact, at the moment we don’t even know whether the 
model M 2 = M[G 0 \ [Gi] obtained by two consecutive generic extensions can 
also be obtained by a single generic extension, that is, whether M 2 = M[G] 
for some P £ M and an M- generic filter G in P. 

In this section we will concentrate on this problem only in the easier 
case where both partially ordered sets Po and Pi leading to the extensions 
Mi = M[G\\ and M 2 = Mi[Gi], respectively, belong to the ground model 
M. The general iteration, including that of two-stage iteration, will be 
described in the next section. 

In the simple case when Po,Pi £ M the forcing P is just a product of 
forcings (Po, <o) and (Pi, <i), that is, P = P 0 x Pi and 


(po,pi) < (<io,qi) o Po <o qo & p\ <i qi- 
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The relation between the forcing extensions obtained by these partially 
ordered sets is described in the next theorem. We will start, however, with 
the following simple fact. 

Proposition 9.4.1 Let P be a product of the forcings Po and Pi. Then G 
is a filter in P if and only if G = Go x G\ for some filters Go and G \ in 
Po and Pi , respectively. 

Proof First, assume that G is a filter in P. Let Gq and G\ be the pro- 
jections of G onto the first and the second coordinates, respectively. Then 
GcGqX G\. 

To see the other inclusion take p 0 G Go and q\ £ G\. Then there exist 
qo £ Gq and pi £ G \ such that (po,pi), ( go , <7i) G G. Let (ro, ri) G G be an 
extension of (p 0 ,Pi) and (q 0 ,qi). Then (r 0 ,ri) < (p 0 ,qi), so (p 0 ,qi) G G. 

The proof that Go and Gi are filters is left as an exercise. It is also 
easy to see that G = Go x Gi is a filter, provided Go and Gi are. □ 

Proposition 9.4.1 tells us, in particular, that any filter in Po x Pi is a 
product of filters in Po and in Pi. This gives a better perspective on the 
next theorem. 

Theorem 9.4.2 (Product lemma) Let M be a countable transitive model 
of ZFC, let P 0 ,Pi G M be partially ordered sets, and let Go C Po and 
G i C Pi be filters. Then the following conditions are equivalent. 

(i) Go x Gi is an M -generic filter in Po x Pi. 

(ii) Go is an M -generic filter in Po and G i is an M[Go\-generic filter 
in Pi. 

(iii) Gi is an M-generic filter in Pi and Gq is an M[G\\-generic filter 
in P 0 . 

Moreover, if any of these conditions hold then M[Gq x Gi] = M[Go][Gi] = 
M[Gi][G 0 ], 

Proof Note first that the natural isomorphism between Po x Pi and Pi x Po 
maps Go x Gi onto Gi x Go- Thus, by Lemma 9.3.7, condition (i) is 
equivalent to the condition 

(i') Gi x Go is an M-generic filter in Pi x Po- 

But the equivalence (i , )<t4(iii) is obtained from (i)4=>(ii) by exchanging in- 
dices. So all equivalences follow from (i)<£=>(ii). 
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To prove (i)=>(ii), assume (i). To see that Go is M-generic in P 0 take a 
dense subset Do £ M of Po- Then Do x Pi £ M is dense in Po x Pi. Thus, 
by (i), (D 0 x Pi) fl (Go x Gi) b 0, and we conclude that Z? 0 fl Go b 0. 

To see that Gi is an M[Go]-generic filter in Pi take a dense subset 
Di £ M[G 0 ] of Pi. Let r £ M be a P 0 -name such that Di = val< 3 0 (r) and 
let po £ Go be such that 


Po || — t is dense in Pi. 


Choose pi e Gi, define 

D = {(go, qi) ■ go < Po & go ||- gi G r} <E M, 

and note that D is dense below (po,Pi)- 

Indeed, let (ro,ri) < (po,Pi) and let G' 0 be an M-generic filter in P 0 
containing ro. Then D( = val g' 0 ( t ) is dense in Pi since ro < po, and po 
forces it. Take gi < n with gi £ D( and find a go £ G' 0 that forces it: 

go ||- gi g t. 

Since go and ro belong to the same filter G' 0 they are compatible, and 
taking their common extension, if necessary, we can assume that go < ro- 
But then (go,gi) < (ro,ri) and (go,gi) G D. Thus D is dense below 
(pihPi) £ Go x G\. 

So we can find (g 0 , gi) £ D fl (Go x Gi). But go G Go and g 0 ||— gi G r. 
Thus gi £ valc 0 (t) = D lt so we have found qi £ Did G\. 

To prove that (ii) implies (i) take a dense subset D of P 0 x Pj that 
belongs to M. We will show that D fl (G 0 x Gi) / 0. 

For this, define 

Dx = {pi £ Pi : 3p 0 £ Go (ipo,Pi) G D)} £ M[G 0 \ 

and note that D\ is dense in Pi. Indeed, if r\ £ Pi then the set 

D 0 = {po G P 0 : 3pi < n ((po,Pi) G D)} £ M 

is dense in P 0 and so there exists apo G Do d Go- Thus there exists a 
Pi < ri with (po,pi) £ D and pi £ D\. 

Now, by genericity, there exists api £ G\d D\. So, by the definition 
of D i, we can find a po G Go with (po-Pi) G D. But then (po-Pi) G 
D fl (G 0 x Gi), that is, D fl (Go x Gi) b 0- 

Finally, if conditions (i)-(iii) hold then the equation M[Go x Gi] = 
M[Go][Gi] holds by the minimality of different generic extensions. 

To see that M[Go x Gi] C M[Go][Gi] note that Go G M[Go] C 
M[Gq] [Gi] and Gi G Af[G 0 ][Gi]. So G 0 x Gi G Af[G 0 ][Gi]. Moreover, 
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M C M[G i] C M[Go][Gi]. Thus, by the minimality of M[Gq x Gi], the 
inclusion holds. 

To see the reverse inclusion note that M C M[Gq x Gi] and Go G 
M[GoxGi]. So, by the minimality of M [G 0 ], we have M[G 0 ] C M[GqxG\\. 
But we have also Gi G M[Gq x Gi]. Thus M[Go][Gi] C M[Go x Gi] by 
the minimality of M[Go][Gi]. 

The equation M[Go x Gi] = M[Gi][Go] is proved similarly. □ 

Next, we will see an application of Theorem 9.4.2 to the Cohen model, 
that is, the model from Section 9.2. 

First note that if P (Z) = (Func U (Z, 2), D) and { A , B} is a partition of 
Z then P (Z) is isomorphic to P(A) x P(B) via the isomorphism i : P (Z) — » 
P(A) x P(B), i(s) = (s|a)S|b)- Also, if G is a filter in P (Z) and G C Z 
then 

Gc = {s|c ; s € G} = G n P(G) 

and i[G\ =G A xG B = (Gfl P(A)) x (G D P(B)). 

Now, if M is a countable transitive model of ZFC, Z,A,B £ M are 
such that {A, B} is a partition of Z , and G is an M-generic filter in P (Z) 
then, by Theorem 9.4.2, Ga = Gf 1 P(A) is an M-generic filter in P(A), 
Gb = GnP(B) is an M[Ga\- generic filter in P(B), and 

M[G] = M[G a x Gb] = M[G A ][G B ]. 

This fact and the next lemma are powerful tools for proving different facts 
about the Cohen model. 

Lemma 9.4.3 Let M be a countable transitive model of ZFC, X,Y,Z G 
M, and G an M-generic filter in P (Z) . If f: X —*Y is in M[G\ then there 
exists an A G V M (Z) such that 

M ]= \A\ < \X\ +w 


and f G M[G n P(A)]. 

Proof By Lemma 9.2.8 we can find a P(Z)-name r G M such that / = 
valcfr), t C {((a;, y),p) : x G X & y G F & p G P (Z)}, and 

(*) A x = {p G P (Z): 3y £ Y (((x,y),p) £ r)} is an antichain for every 
x £ X. 

So every A x is countable in M, since P (Z) is ccc in M. 

Let 

A= (J (J dom(p). 

xGX p£A x 
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Then A £ M, A C Z, and |A| < |X|+u; in M, since every set [J peA ^ dom(p) 
is countable in M. To finish the proof it is enough to show that / € 
M[G n P(A)]. 

But for every s = {(x,y),p) € r we have p £ A x C P(A). Thus 
T * = {((x,y) ,p): {{x,y),p) £t} £ M 
is a P(A)-name, where (x,y) is a standard P(A)-name for (x,y) £ M. So 


/ = val G (r) 


{(x,y): 3p £ G ({{x,y),p} £ r)} 

{(x,y): (3 p £ G nP(A))(((x,y) ,p) £ r*)} 
val Gn p(A)(' r *) € M[GnP(A)]. 


This finishes the proof. 


□ 


Since every real number is identified with a function r: u> — > 2, we 
obtain immediately the following corollary. 


Corollary 9.4.4 Let M be a countable transitive model of ZFC, Z £ M, 
and G be an M-generic filter in P (Z). If r £ R M [ G 1 then, in M, there 
exists a countable subset A of Z such that r £ M[G D P(A)]. 

Our next goal is to prove that Martin’s axiom is false in the Cohen 
model obtained by the forcing P (Z) with \Z\ > u>\. For this, first note that 
the formula “r is a real number,” which is identified with “r is a function 
from u) into 2,” is absolute. So for every countable transitive model M of 
ZFC we have 


K m = R flM and [0, 1] M = [0, 1] fl M. 

In particular, if M[G\ is any generic extension of M then 
[0,1] nM.ROM £ M[G ], 

since InM = I M eIc M[G\ and [0, 1] n M = [0, 1] M £ M c M[G}. 

Lemma 9.4.5 Let M be a countable transitive model of ZFC, let Z £ 
M be countable in M, and let G be an M-generic filter in P (Z). Then 
M fl [0,1] £ M[G\ has Lebesgue measure zero in M[G ]. 

Proof Since the forcings P (Z) and P(w) are isomorphic in M we can 
assume, by Lemma 9.3.7, that Z = u>. Also, we will identify numbers from 
[0, 1] with their binary expansions, that is, functions from u> into 2. 
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For s £ P(w) let [s] = {t £ [0, 1] : s C t}. Notice that the Lebesgue 
measure of [s] is equal to l{[s}) = 2~ n , where n = |s|. Now let r = |J G € 2 U 
and for k < to put 

s k = |J M [2 m ,2 m + 1 )] • 

m>k 

Notice that 

l(Sk) < E = E 2_2m < E 2_m = 2_fe - 

ra>/c m>/c m>k 

Thus the set S = Sk has measure zero. It is enough to prove that 

M fl [0, 1] C Sk for every k < to. 

So fix k < u) and t £ M fl [0, 1] and define 

D = {s € P(w): 3 to > k (s|[ 2 m 2^n+l^ — [2 m 2 m + 1 ) )} e M. 

It is easy to see that D is dense in P(w). Thus, GCiD ^ 0, that is, there is 
an m > k such that r |[ 2 >«, 2 m + 1 ) = t|[ 2 m , 2 m + 1 )- So t £ [r |[ 2 ”>, 2 m + 1 )] C Sk- □ 

Theorem 9.4.6 Let M be a countable transitive model of ZFC, Z £ M 
uncountable in M, and G an M -generic filter in P (Z). Then, in M[G\, the 
interval [0,1] is a union of ui\ sets of Lebesgue measure zero. 

Proof First assume that Z has cardinality u>i in M. Then the forcings 
P (Z) and P(wi x w) are isomorphic. So, by Lemma 9.3.7, we can assume 
that Z = cvi x lo. 

Now, for every £ < o>i, 

M[G\ = M[G n P ((wi \ {£}) xw)][GnP({(}xw)], 

so, by Lemma 9.4.5 used with M[GC \ P ((uq \ {£}) x w)] as a ground model, 
there exists a set £ M[G\ of Lebesgue measure zero such that 

M[G n P((wi \ {£}) X w)] n [o, l] c % 

But, by Corollary 9.4.4, for every r £ M[G] fl [0, 1] there exists in M a 
countable subset A of Z with r £ M[G fl P(A)]. Hence there exists a 
£ < u>\ such that A C (uji \ {£}) x to and 

r e M[G nP(.A)] cM[GnP((wi\{(}) xw)]. 

Therefore M[G\ fl [0, 1] C U^< Wl '%• This finishes the proof of the case 
when M \= \Z\ = u>i- 

For the general case, take Zo £ V M (Z) of cardinality u>i in M. Then 
M[G] = Mi[Gi], where Mi = M[GnP(Z \ Z 0 )] and G\ = GflP(Z 0 ) is an 
Mi-generic filter in P(Zo)- The conclusion of the theorem follows from the 
first case. □ 
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Corollary 9.4.7 Let M he a countable transitive model of ZFC and as- 
sume that Z £ M has cardinality at least u >2 in M . If G is an M-generic 
filter in P (Z) then, in M[G ], the interval [0,1] is a union of less than 
continuum many sets of Lebesgue measure zero. 

In particular, Martin’s axiom is false in M[G\. 

Proof Theorem 9.2.13 implies that c > u>i in M[G\. Hence, by Theo- 
rem 9.4.6, we can conclude the first part of the corollary. The second part 
follows from the first part and Theorem 8.2.7. □ 

We will finish this section with an estimation of the size of the contin- 
uum in the models obtained by ccc forcing extensions. 

Theorem 9.4.8 Let M be a countable transitive model of ZFC, P € M 
be an infinite ccc forcing in M, and G be an M-generic filter in P. If 
k = |P^| M then 

M[G] \= c < k. 

Proof Since, by Theorem 9.2.11, the models M and M[G] have the same 
cardinals, we will not distinguish between them. Let T be the family of all 
P-names r C {((n,i),p): n € u> & i £ 2 fcp£ P} such that 

(★) A n = {p £ P: 3i £ 2 ((( n,i),p ) £ r)} ^ 0 is an antichain for every 
n £ ui. 

Then T £ M and every A n is at most countable. 

By Lemma 9.2.8 for every f £ 2“ fl M[G\ we can find a r £ T such that 
/ = vale ( t]. In particular, the image val g[F] of T under vale contains 
2“ (~l M[G\. So c < \F\ in M[G\, and it is enough to prove that \F\ < k. 
We will show this in M. 

To see it, define in M a map ft,: (2 x P)“ XUJ — > F such that for every 
t £ (2 x P)“ xw if t(n,j) = ( i n ,j,Pn,j ) then 

h{t) = {((n,i n j),p nj ): n,j £ w}. 

Note that ft is onto T , since every set A n is at most countable. Thus 

1^1 < |(2 X P)“ x “| = K. 

This finishes the proof. □ 

Corollary 9.4.9 Let M be a countable transitive model of ZFC, k an in- 
finite cardinal in M, and G an M-generic filter in P(k). If k = |k w | M 
then 

M[G\ \= c = k. 

In particular, if CH holds in M and k = then 
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Proof First note that |P(/c)| = k, and sok = \k u \ M = |P(k)“| M . Thus, 
by Theorem 9.4.8, < n. The other inequality for k = u >2 follows from 

Theorem 9.2.13. The general case is left as an exercise. 

To see the additional part note that under CH the regularity of u >2 
implies 


/ (jj 

LO 2 ^ ^2 


IU‘ 


— 0 — ^2 0 ^ — ^2 0 C — UJ2 0 a, 2 — C02- 


So, under CH, k = 0 J 2 satisfies the assumptions of the main part of the 
corollary. □ 


EXERCISES 

1 Complete the proof of Proposition 9.4.1. 

2 Show that the set D from the proof of Lemma 9.4.5 is indeed dense 
in P(w). 

3 Complete the proof of Corollary 9.4.9 for arbitrary n by showing that 

c M ^ > «. 

4 Let M be a countable transitive model of ZFC, P £ M a partially ordered 
set, and G an M-generic filter in P. Prove that M[G } and M have the same 
ordinal numbers. 


9.5 Model for MA+^CH 

The goal of this section is to prove the consistency of MA+“c = u> 2 -” For 
this, we will follow the idea mentioned at the beginning of the previous sec- 
tion. We will start with a countable transitive model M of ZFC+“2 Wl =W2,” 
which exists by Corollary 9.3.9, and find its generic extension N via a ccc 
forcing P M £ M for which there exists a sequence 

M = M 0 c Mi c • • • C c • • • C M m = N 

with /r = = u >2 and the following properties: 

PI M^ + i = Aft [G^] for every £ < //, where G ^ is an generic filter in 
some forcing P^ £ M^. 

P2 For every Y £ M and S C Y with N |= |5| < 0 J 2 there exists an a < fi 
such that S £ M a . 
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P3 |(P M ) W | M < wf. 

P4 For every forcing (A, <C) £ N such that 

N \= (A is an ordinal number, A < Wi, and (A, <C) is ccc) 

and every a < /r there is a £ < /x, with £ > a, such that P ? = (A, <C). 

To argue that P1-P4 imply N |= MA+“c = tof' we also need the 
following definitions and lemma. For an uncountable cardinal number n 
and a partially ordered set P we will write MA K (P) to denote the statement 

MA k (P) For every family V of dense subsets of P such that \T>\ < k there 
exists a P-generic filter F in P. 

We will also use the symbol MA K for the statement that MA K (P) holds for 
every ccc forcing P. Thus MA is equivalent to MA C . 

Lemma 9.5.1 For an uncountable cardinal number k the following state- 
ments are equivalent: 

(i) MA k ; 

(ii) MA k (P) holds for every ccc forcing P = (P, <) with |P| < n; 

(iii) MA k (P) holds for every ccc forcing P = (A, <C), where X is a cardinal 
number less than n. 

Before we prove Lemma 9.5.1 let us see how it can be used to conclude 
N |= MA+“c = w 2 ” from P1-P4. 

First, we will prove that 


N |= MA W2 . (9.7) 

By Lemma 9.5.1 it is enough to show in N that MA W2 (P) holds for every 
ccc forcing P = (A, <C) with A being an ordinal number less than or equal 
to u>i . For this, let (A, <C) £ N be ccc in N with A < u>i , and let 

V = {Dc: £ < £ N be a family of dense subsets of (A, <C). Since 

P^ is ccc we have A < = uif 1 , and <C as well as x are 

subsets of utf 1 x £ M. Hence, by P2, there exists an a < /x such that 

( A , <),P £ M a . Moreover, by P4 there exists a < £ < /x with P ? = (A, <C). 
In particular, G ^ £ M %+ 1 is an generic filter in P^ = (A, <C). But 

V C M a C M%. So is also P-generic in (A, •C). Condition (9.7) has 
been proved. 

Now MA W2 and c = u >2 clearly imply MA. Thus, it is enough to argue 
that N |= c = o> 2 - But the inequality c N < u >2 follows from P3 and 
Theorem 9.4.8. The other inequality follows from the fact that 



198 


9 Forcing 


MA W2 => VG e [2“]- Ul 3/ £ 2“ \ G. (9.8) 

To see it, put P(w) = (Func aJ (w, 2), D) and for every g € 2“ define D g = 
{s G P(w): s (jL g}. Then the sets D g are dense in P(w). Let F be a 
{D g : g £ G}-generic filter in P(o>) . Then any extension of 1J F to a function 
/: u> — > 2 will have the desired property. The details are left as an exercise. 

Before we move to the technical aspects of the construction, it is worth- 
while to reflect for a moment on the idea behind the foregoing argument. 
It is clearly of the transfinite-induction nature and the argument is of a 
diagonal character. Condition PI represents an inductive step in which we 
take care of “one problem at a time.” Condition P2 represents a kind of 
closure argument. It tells us that the “small objects” from N can already 
be found in the earlier steps of our construction and thus we will have a 
chance to “take care of them” in the later part of the induction. 

The construction of the forcing P M leading to a model satisfying PI 
and P2 is of a very general nature and can be compared to the recursion 
theorem. The specific aspects of our model are addressed mainly by con- 
dition P4, with Lemma 9.5.1 allowing us to reduce the induction to one of 
reasonably short length. 

This construction closes, in a way, a full circle that we have made in this 
course. We started with simple recursion proofs. Next, when the difficulties 
with the length of induction mounted, we started to use refined recursive 
arguments by introducing additional axioms such as CH, <$•, and MA. This 
led us all the way to the forcing arguments of the previous sections, and 
finally to finish here by coming back to a relatively simple transfinite- 
induction argument. 

The preceding argument shows that in order to prove the consistency 
of MA+“c = u> 2 V it is enough to prove Lemma 9.5.1 and find a generic 
extension N of M satisfying P1-P4. We will start with Lemma 9.5.1. 


Proof of Lemma 9.5.1 Clearly (i)=>(iii) . Thus it is enough to show 
(iii)=>(ii) and (ii)=>(i). 

(ii)=>(i): Let P = (P, <) be a ccc partially ordered set and let V be 
a family of dense subsets of P such that \V\ < k. Choose a function 
/: P x P — ■> P such that f(p, q) extends p and q provided p and q are 
compatible in P, and for every D € V pick fo '■ P — > D such that fn {p) < p 
for every p £ P. Let po £ P and let Q be the smallest subset of P containing 
Po and being closed under the operations / and fn for every D £ T>. Thus 

Q = U Qn, 

n<u 
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where Q 0 = {Po} and Q n+1 = Q„ U f[Q n x Q n ] U \J D &V folQn] (compare 
with Lemma 6.1.6). Then \Q\ < \T>\ + u> < k. 

Note that any p,q £ Q that are compatible in P are also compatible 
in P 0 = (Q,<), since Q is closed under the action of /. Thus Po is ccc 
and, by (ii), MA K (P 0 ) holds. But, by the closure of Q under the action 
of fo, for every D £ V the set D Cl Q is dense in P 0 . So there exists a 
{D HQ: D £ T>}-generic filter F 0 in Po- To finish the proof it is enough to 
notice that F = {p € P : 3q £ Fq (q < p) } is a 2?-generic filter in P. The 
details are left as an exercise. 

(iii)=>(ii): The proof is very similar to that of Lemma 9.3.7. To see it, 
let P = (P, <) be a ccc partially ordered set such that A = |P| < n and let 
V be a family of dense subsets of P such that \V\ < k. Choose a bisection 
/ between A and P and define a partial-order relation <C on A by putting 

a</? f(a) < f{(3). 

Notice that / is an order isomorphism between Po = (A, <C) and P = 
(P, <). In particular, f~ 1 (D) is dense in Po for every D £ V. So Vq = 
{/ -1 (.D): DeD) is a family of dense subsets of Po and |P 0 | = |^| < «• 
Hence, by (iii), there exists a Po-generic filter Po in Po- To finish the proof 
it is enough to notice that /[Po] is a P-generic filter in P. The details are 
left as an exercise. □ 

To find a model N satisfying P1-P4 we first need to come back to 
the problem of expressing a model obtained by two consecutive generic 
extensions as a single generic extension. More precisely, if Mi = M[Go], 
where Go is an M-generic filter in Po £ M, and M 2 = M[Go][Gi], where 
Gi is an M[Go]-generic filter in Pi £ M[Gq\, we would like to find a P £ M 
and an M-generic filter G in P such that M 2 = M[G\. The product lemma 
(Theorem 9.4.2) gives a solution to this problem when Pi £ M. Thus, we 
will concentrate here on the case when Pi £ M[Go] \ M. To define such P 
we need the following definition. 

A pair (n, a) £ M of P-names satisfying the condition 

P ||— a is a partial-order relation on 7r (9.9) 

will be called a good F-name for a partially ordered set. (Note that, formally 
speaking, a good P-name is not a P-name.) If (n, a) is a good P-name for 
a partially ordered set we will often write 7 r for (ir, a). 

The following lemma tells us that any forcing Pi in M[Go] has a good 
Po-name representing it. So it explains the restriction of our attention to 
good names only. However, the lemma will not be used in what follows, so 
we will leave it without a proof. 
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Lemma 9.5.2 Let M be a countable transitive model of ZFC, P € M a 
partially ordered set, and Go an M-generic filter in P. If Pi = (P, <) € 
AI[Gq\ is a partially ordered set in M[Gf\ then there are P -names tt, a £ M 
satisfying (9.9) and such that (P,<) = (val Go (7r), val Go (cr)). 

Now let P be a partially ordered set and (tt, <„) be a good P-name for 
a partially ordered set. Then their iteration P * 7r is defined by 

P*7r = {(p, r):p€Pfcre dom(7r) & p ||— r £ tt}. 

Moreover, on P * tt we define a binary relation ■< by 

(p, t) ■< (q, <?) P< Q & p\\- t < n a. 

Notice that the relation ^ is reflexive and transitive on P * tt. It is 
reflexive since for any (p, r) £ P * tt condition p forces r £ tt and <„■ to be 
a partial-order relation on tt (since (tt, < n ) is good). The proof that ^ is 
transitive is left as an exercise. 

Unfortunately, ^ does not have to be antisymmetric on P*7r. To solve 
this problem, we notice that the relation ~ on P * n defined by 

(p, t) ~ (q, cr) 44> (p, t) < (q, a) & (q, a) ^ (p, r) 

is an equivalence relation on P*7r (see Exercise 4 from Section 2.4). Thus, 
in order to consider (P*7r, ■<) as a partially ordered set, we will identify the 
elements of P * 7r that are ~-equivalent . (Formally, we should replace P * tt 
by the quotient class P*7r/U and the relation ^ by the quotient relation 
< defined as in part (b) of Exercise 4 in Section 2.4. However, this would 
obscure the clarity of the “simple-identification” approach.) Upon such an 
identification we will consider P * n as a partially ordered set. 

Let us also notice that if the partially ordered sets Po and Pi are in the 
ground model M and tt and < T are the standard Po-names for Pi and its 
order relation then (tt, < n ) is a good name for a partially ordered set and 

Po*7r is isomorphic to Po x Pi. (9.10) 

Thus the process of iteration is truly a generalization of the product of 
forcings as described in the previous section. To formulate an iteration 
analog of the product lemma, we need one more definition. 

Let M be a countable transitive model of ZFC, Po £ M be a partially 
ordered set, and G 0 be an M-generic filter in P 0 . If tt is a good P 0 -name 
for a partially ordered set and Gi C valc 0 (tt) then we define 

G 0 *Gi = {(p, t) £ P 0 *tt: p£ G 0 & val Go (r) £ Gi}. 
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Theorem 9.5.3 Let M be a countable transitive model of ZFC and Po*7r 
be an iteration of a partially ordered set Po € M and a good Po -name 
7 t £ M for a partially ordered set. If G is an M -generic filter in Po *7r and 

G 0 = {p G P 0 : 3 (q, r) G G (q < p)j 
then Gq is an M -generic filter in Po- Moreover, if 
G\ = {val Go (r) : {q, r) G G} 

then G\ is an M[Go\-generic filter in Pi = val Go (7r), G = Go *Gi, and 

M[G) = M[Go][Gi). 

Proof It is easy to see that Go is a filter in Po. To show that Go is M- 
generic take a D 0 G M that is dense in P 0 . Then D={(p,r) G P 0 * 7 t: p G D 0 } 
is dense in Po*7r, since (p, r) ^ (q, r) and (p, r) G D for every (q, r) G Po*7r 
and p G D 0 with p < q. Take (p, r) G G (ID. Then p G Go n D 0 . Thus Go 
is an M-generic filter in Po- 

Next, we will show that Gi is a filter in Pi. So take val Go (r) G Gi with 
(q,r) G G witnessing it and let a G dom(7r) be such that val Go (cr) G Pi 
and val Go (r) < val Go (cr). We will show that val Go (cr) G Gi. For this, take 
Po G Go with po ||— r < cr and (p, p) G G such that p < po- Pick (r, 77) G G 
with (r, if) -< ( q,r } and (r, 77) ^ (p,p)- Then r ||— 77 < r and r ||— r < a, 
since r < po- So r ||— 77 < a, and (r, 77) ^ (r, cr). Hence (r, a) G G and 
val Go (cr) G Gi. 

To see that any two elements of Gi have a common extension in Gi 
take val Go (r) G Gi with (q,r) G G and val Go (cr) G Gi with (p, a) G G. 
Then there exists (r, 77) G G such that (r, 77) ^ (<7, r) and (r, 77) ^ (p, cr). So 
r G Go and r ||— (77 < r & 77 < cr). Thus val Go (?7) G Gi extends val Go (r) 
and val Go (cr). 

For the proof that Gi is M[Go]-generic in Pi take a D\ G M[Go] that is 
dense in Pi and let 6 be a P 0 -name such that D\ = val Go (<5). Pick p G Go 
that forces it, that is, 


p ||— 6 is a dense subset of 7 r, 


and define 


D = {(q,r} G P 0 *7r: q ||- r G 5}. (9.11) 

Now, if (q, rj) G G is such that q < p then it is easy to see that D is dense 
below ( q , 77) (the proof is left as an exercise). So there exists (r, r) G DflG, 
and val Go (r) G DiflGi. 
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To argue that G = Gq*G\ take (p, r) G G. Then p G Go and val< 3 0 (r) G 
G\ by the definitions of Go and G\ . So G C Go * Gi . To see the other 
inclusion, take (p, r) G Go * Gi C Po * n. Then p G Go and valc 0 (t) G Gi. 
So there exist (q, a) G G, with q < p, and (r, r) G G. Let ( s , rf) G G be a 
common extension of (g, <j) and (r, r). Then s < q < p and s ||— p < r. So 
(s, 77 ) (p, r) and (p, r) G G. 

Finally, Go, Gi G M[G], as they are defined using G. So M [Go] C M[G] 
by the minimality of M[Go], and M[Go][Gi] C M[G] by the minimality of 
M[G 0 ][Gi], Also', G = G 0 *Gi G Af[G 0 ][Gi]. So M[G] c Af[G 0 ][Gi]. ' □ 

Theorem 9.5.3 is an analog of the implication (i)=>(ii) from the product 
lemma. The iteration analog of the reverse implication, stated next, is also 
true. However, since we will not use this fact, we will leave its proof as an 
exercise. 

Theorem 9.5.4 Let M be a countable transitive model of ZFC, Go an M- 
generic filter in Po G M, and G 1 an M[Go\-generic filter in Pi G M[Gq\. 
If 7T G M is a good Po -name representing Pi then Go*Gi is an M -generic 
filter in P 0 *7r and M[G 0 ][Gi] = M[G 0 *Gi]. 

In the proof of the consistency of MA+->CH we will be interested only 
in the ccc forcings. The next lemma tells us that we will remain within 
this class if the iteration concerns two ccc forcings. 

Lemma 9.5.5 Let M be a countable transitive model of ZFC, Po G M be 
a ccc forcing in M, and ( 7 r, <„) be a good Po -name for a forcing such that 

Po ||— (ir, <n) is ccc. (9.12) 


Then Po * 7r is ccc in M. 

Proof To obtain a contradiction assume that there exists an uncountable 
antichain A = {(pj, r^) : £ < u>\} G M in P 0 * 7r. Let 

o’ = {(l,P£): £ < wi}. 

Then a G M is a Po-name. Define, in M, 

D = {p G P 0 : 3(3 < Wi (p ||- cr C /?)} 
and notice that D is dense in Po- 

To see this, let q G Po and let H be an M- generic filter containing q. 
Then 

S = valir(cr) = {£ < uji : p$ G H}. 

Notice that for distinct (,j;G 5 the conditions and val h { t v ) are 

incompatible in val^(7r). 
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Indeed, if £,77 £ S, £ ^ 77, and val//(r^) and val^r,,) are compatible in 
val#(7r) then there are r £ H extending p ^ and p v and p £ dom(7r) such 
that r ||— p 77 and r ||— p < x t v . But then (r, p) would be a common 
extension of (77,77) and (77,77), contradicting our assumption that A is 
an antichain. 

Thus S is countable in M[H\ since, by ( 9 . 12 ), val_y(7r) is ccc in M[H], 
So there exists a (3 < u>\ containing S. Now, if p £ H is such that p < q 
and p ||— S' C (3 then p £ D. So D is dense. 

Next consider the set 

T = {/3 < w x : 3 p 0 £ P 0 {p/3 ||- sup a = ( 3 )} 

and notice that A x = {pp: [3 £ T} is an antichain in Po- Thus A x and T 
are countable in M. Let 7 < u>i be such that T C 7. Then the set 

D' = {p £P 0 : p\\- a Cp} 
is dense in Po, since it contains D. 

Now, to obtain a contradiction, take an M-generic filter H in Po con- 
taining p 1 . Then, by the denseness of D', vain (cr) C 7. But, straight from 
the definition of <r, 7 £ val#(cr)! This contradiction finishes the proof. □ 

Now we turn our attention to the a-stage iteration for an arbitrary 
ordinal number a. More precisely, we will describe a general method of 
defining inductively a sequence P = ((P{,<{): £ < a) of forcings such 
that Pc+i can be identified with P^ * 717, where 7 7- is a good Pj-name for 
a partially ordered set. This, with help of Theorem 9 . 5 . 3 , will take care of 
the condition PI. The technical difficulty in this construction is that we 
have to know Pc in order to talk about a good P^-name 717. So, along with 
the sequence P, we have to construct a sequence II = ((7Tj,<^): f < a) 
of appropriate good P^-names. This obscures a simple idea that stands 
behind the construction. Thus we will first describe this construction with 
the sequence II replaced by a sequence ((Q^,<q ( ) : £ < a) of forcings. In 
this case for every (3 < a we define 

p / 3 =u| n^ : /e n^ & se[/ 3] <a 4 ( 9 .i 3 ) 

[)?eS J [ r)<0 J 

and order it by 

P<0 q O dom (q) C dom(p) & (V?7 € dom(g)) (71(77) < Qv q(r])) . 

In particular, 

Po = { 0 }, Pa = u P/3 for every limit ordinal 0 < A < a, 

0<\ 


( 9 . 14 ) 
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and 


P 7 C P /3 and < 7 =</3 fl(P 7 x P 7 ) for every 7 < (3 < a. (9.15) 

Notice also that 0 is a maximal element of every P /3 with (3 < a and that 
for every £ < (3 < a 

if r G P /3 extends p € Pj then r G P^ also extends p (9.16) 

and 

if q G P /3 and r G P^ extends q\$ then r U </|(/ 3 \{) G P /3 extends q. (9.17) 

Moreover, if P | +1 = P 5+1 \ P 5 = {p G P^+i : £ G dom(p)} for £ < a then 

(P£+i,<£+i) is isomorphic to (P^xQj,^) (9.18) 

via the isomorphism p 1 — > (p|^,p(£)). The easy proofs of (9.14)-(9.18) are 
left as an exercise. 

For the general case, when Pj-names 7 17 do not necessary represent the 
forcings from the ground model, the sequence ((Pc,<^): £ < a) will be 
defined by simultaneous induction with a sequence (( 717 , £ < a). In 

particular, we say that a sequence ((P^,<{): £ < a) of partially ordered 
sets is an a-stage forcing iteration (with finite support) if there exists a 
sequence (( 7 T^,< n( ): £ < a) such that every ( 7 T£,<„. s ) is a good P^-name 
for a partially ordered set, and if for every (3 < a 


**= u dom(7T r) ) : V 77 G S (p\ v ||- p{rj) G tt v ) > (9.19) 

se[/3]<- [ ves ) 

is ordered by 

P </3 q <=> dom(g) C dom(p) & (V 77 G dom (<?)) (p\ v ||- p(rj) q(r])). 

It is not difficult to see that the definition (9.19) gives the same notion of 
forcing as (9.13) if every 717 is the standard P^-name Q £ of Q 5 . Moreover, it 
is also easy to notice that the sequence defined by (9.19) has the properties 
(9.14)-(9.17) as well (the inductive proof of (9.17) is left as an exercise). 
Moreover, for P | +1 = {pG Pj+i : £ G dom(p)} a counterpart of (9.18) says 
that for every £ < a 


<P| + i, <^+i) is isomorphic to (P^ * 717 , ^), 


(9.20) 
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where an isomorphism is given by the map p i— > (p|{,p(£))- To see this, it 
is enough to notice that for every p,q £ P| +1 

P <t+i Q O dom(g) C dom(p) & (V 77 G dom(g) n () ( p\ v ||- p(rj) <*„ q{i 7 )) 

& p\e II- p(0 < 7T« q ( 0 
^ p\z <« q\t & p\s lb p(0 ?(0 

^ (pk,p{0) di (qk,q{0)- 

The next theorem is an a-stage-iteration version of Theorem 9.5.3. It 
implies the condition PI, where is defined as M[G^\ from the theorem. 

Theorem 9.5.6 Let M be a countable transitive model of ZFC, let a G M 
be an ordinal number, and let ((P^, <{) : £ < a) G M be an a-stage forcing 
iteration based on the sequence ((n^, <tt s ) ■ £, < a) of appropriate P ^-names. 
If G is an M-generic filter in P Q then = G fl Pj is an M -generic filter 
in Pj for every £ < a. Moreover, for every £ < a, if 

G 5 = {val G? (p(0): P G G €+ i & £ G dom(p)} 

then G ^ is an M[G^\-generic filter in a forcing P^ = (valc € (7T|), val G , (< T£ )) 
andM[G i+1 }=M[G t ][Gt}. 

Proof To see that Gj is a filter in P^ take p, q G Gj C G. Then there 
exists an r G G extending p and q. But r|c G P 5 belongs to G, since r < r||. 
So r|j G G^ and, by (9.16), r| ? extends p and q. Also, if q G Gj, p G P a , 
and q < p then p G Pj, and so p G Gj. Thus G{ is a filter in Pc. 

To prove that G^ is M-generic in P^ take a D G M that is a dense 
subset of P{ and notice that 

D* = {p G P a : p\ c G D} G M 

is dense in P a . Indeed, if q G P Q then q |c G Pj and there exists an r G D 
with r <£ q |j. But, by (9.17), p = r U <z|( a \j) G P a extends q and it is easy 
to see that p G D*. So D* is dense and there exists an r G G fl D*. Hence 
r|| G D fl G^. 

The proof of the additional part will be done in several steps. First 
notice that 


P| + i is dense in P^+i, (9-21) 

since every p G Pf+i \ P | +1 = Pc forces (n^, < 7r? ) to be a partially ordered 
set, so there exists a Pc-name r such that p ||— r G 7 rj and pU{ (£, r)} G P | +1 
extends p. 
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Next notice that 


G£ +1 = GnP| +1 = G l+1 \P ? 

is also an M-generic filter in P| +1 , since, by (9.21), every dense D G M 
subset of P| +1 is also dense in P^+i and D n G| +1 = D C l G^+i yf 0. Also, 
Gt+i = G^ U {p| £ : p G G| +1 } G M[G£ +1 ]. So M[G ?+1 ] = M[G£ +1 ], 

Now, if i is an isomorphism from (9.20) then 

*[G£ +1 ] = {(p||,p( 0 ) : P e GJ +1 } 

is an M-generic filter in P^ *717 and M[G|+i] = M[G| +1 ] = M[i[G| +1 ]]. 
So, by Theorem 9.5.3, i[G| +1 ] = G{ * G^, G^ is an M[G^]-generic filter in 
F* and M[G c+ i] = M[*[G| +1 ]] = M[G ? ][G«]. □ 

The next theorem says that an a-stage iteration (with finite support) 
of ccc forcings is ccc. This will guarantee that the forcing P^ from the 
beginning of the section will be ccc. The theorem is a generalization of 
Lemma 9.5.5. 

Theorem 9.5.7 Let M be a countable transitive model of ZFC, let a G M 
be an ordinal number, and let ((P{, <{) : £ < a) G M be an a-stage forcing 
iteration based on the sequence ((7 17, <„■*) : £ < a) of appropriate P ^-names. 

If 


P^ ||— 7 17 is ccc (9.22) 

for every £ < a, then P Q is ccc in M . 

Proof We will prove, by induction on /? < a, that Pp is ccc in M. The 
proof will be done in M . 

Clearly P 0 = {0} is ccc. So let 0 < (3 < a be such that P^ is ccc in M 
for every 77 < (3 and let A = {p 1 : 7 < 07} C P@. We will find 6 < 7 < u>\ 
such that ps and p 7 are compatible. 

If 0 is a successor ordinal, say j3 = f + 1, take C 0 = {7 < 07 : p 7 G P^}. 
If | G 0 1 = u)± then A 0 = {p 7 : 7 G Co} C Pt, and Pc is ccc by the inductive 
assumption. Hence Aq C A contains two compatible elements with different 
indices. So assume that | Go | <07. Then C\ — uj\ \ Go is uncountable and 
Ai = {p 7 : 7 G Gi} C P| +1 . But P| + i is isomorphic to P^ *717, which, by 
Lemma 9.5.5, is ccc. Thus, once more, A\ C A contains two compatible 
elements with different indices. 

If j3 is a limit ordinal, apply to the family (dom(p 7 ) : 7 < 07} the A- 
system lemma to find an uncountable set G C u>\ and a finite set D C (3 
such that dom(p 7 ) fl clom(p5) = D for all distinct 7 , 6 G G. Let f < [3 be 
such that D C f and consider {p 7 |{: 7 G G} C P{. Since, by the inductive 
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assumption, Pc is ccc, we can find p £ Pc and distinct 7 ,6 £ C such that 
p Pj\i and P <£ Ps If- To finish the proof it is enough to notice that 

q = pUPj\(0\£) Up«| ( ^) (9.23) 

belongs to P^ and extends p 7 and ps; this follows from (9.17). The details 
are left as an exercise. □ 

We say that an a-stage forcing iteration ((P{,<{): £ < a) is a ccc 
forcing iteration if it satisfies condition (9.22). 

The next lemma is the last general fact concerning the a-stage iteration 
and it will imply property P2. It is an analog of Lemma 9.4.3. 

Lemma 9.5.8 Let M be a countable transitive model of ZFC, let n £ M 
be an infinite regular cardinal number, and let ((Pj,<j): £ < n) £ M be a 
K-stage ccc forcing iteration. If Y £ M, G is an M-generic filter in P K , 
and S C Y is such that M[G] |= IS) < k then there exists a ( < n such 
that S £ M[G(], where G^ = G (~l Pf . 

Proof By Theorem 9.5.7 the forcing P K is ccc. So the cardinals in M and 
M[G\ are the same, and we will not distinguish between them. 

Let X = A, where A is the cardinality of S in M[G). Then X £ M and 
|X | = A < K. Let / £ M[G ] be a map from X onto S. We will find a(<K 
such that / £ M[G^]. 

By Lemma 9.2.8 we can find a P K -name r £ M such that / = val G (r), 
t C {((x, y),p): x £ X & y £Y fcpg P K }, and 

(*) A x = jp £ P K : 3y £ Y ^((x, y),p) £ tJ | is an antichain for every 
x £ X. 

So every A x is countable in M, since P K is ccc in M. Let 

A= u U dom(p) . 

xex peA x 

Then A £ M, A C k, and |Al| < \X\ + u> < k in M, since every set 
dom(p) is countable in M. Thus, by the regularity of k, there exists 
a(<K such that 4c(. In particular, (J x A x C P G So 

T * = { 2/) iP)' ((x,y),p) € r| £ M 

is a P^-name, where (x, y) is a standard P^— name for (x, y) £ M, and 

/ = val G (r) = {(x,y): 3p£ G (((x,y),p) £ r)J 

= {(a:,y>: (3p £ G n P f ) ({(x,y) ,p}eT*)j 
= val G< (r) £ M[G(f\. 
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So S= f[X] G M[G C }. □ 

Now we are ready for the main theorem of this section. 

Theorem 9.5.9 It is consistent with ZFC that c = W2 and MA holds. 

Proof By the foregoing discussion, it is enough to construct a fj ,- stage 
ccc finite-support forcing iteration ((P{,<j): £ < n), with /j = , that 

satisfies P3 and P4. The construction will be done in a countable transitive 
model M of ZFC in which 2 Wl = u >2 (such an M exists by Corollary 9.3.9). 
In what follows we will use the notation introduced in the first part of this 
section. We will also use repeatedly the fact, following from (9.15), that 
for every £ < £ < /i any P^-name is a P^-name too. 

Let ((o;j,/3{): ( < /i) € M be a one-to-one enumeration of g x /i 
such that /?£ < £ for every £ < //. By induction on ( < (i we will con- 
struct a finite-support iteration ((Pj, <$) : £ < /z), together with a sequence 
((717 , : £ < m) °f appropriate P^-names, and a sequence (S^: £ < /x) 

such that the following inductive conditions are satisfied for every £ < /. i . 

(I«) ! p d < wi- 

(lie) ^ = ((A| , <t|) : a < /I) contains every pair (A, <r) with A < w\ and 

a C {((7 ,S),p): 7,«5 G A & p G P 4 }. 

(ct will be used as a P^-name for a partial order on A.) 

(IIIj) 717 = A is a standard Pj-name for A = Xa\ and 

<tt £ = {(<7 ,8),p) eaa\--3qe A(q<tp)} 

U{((7 7<<5<A&pG.B}, 


where 

A = {<7 G P^ : <7 ||— (A, <Ja\) is a ccc forcing} 

and 13 = {p G P^ : p is incompatible with every q G A} . In partic- 
ular, 


Pc ||— (7 17, < w? ) is a ccc forcing. (9.24) 

To see that this inductive construction preserves (1^) consider two cases. 
If 0 < £ < fi is a limit ordinal, then P^ = (J ^ p ») has cardinality < uq 
by the inductive assumption. If £ = 77 + 1 is a successor ordinal, then 
Pj = P^ U P|, and P| is isomorphic to P^ * 7r v . But by (I,,) and (III,,) we 
have 1^1 < u> 1. So |Pc| = |P^| ® (|P^| ® |7r^|) < u\ as well. 
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To see that the sequence S £ in (11^) can be found note that there are only 
to 2 pairs (A, a) to list since, by (Ij), the set {((7, S),p) : 7, 6 G A & p € Pc} 
has cardinality < u>i and so it has at most u> 2 subsets, as 2 Wl = u 2 in M. 

To finish the construction it is enough to argue for (9.24). For this, 
notice first that 


A U B is dense in 


(9.25) 


and 


P ||- (crf^ =<x 5 ) for every p £ A. (9.26) 


To see (9.26) take po £ A and an M -generic filter H in P^ containing 
Po . Then 


val = 



((7 ,6),p) 

G 

& 

P G H} 



{(7, 6): 

((7 ,6),p) 

G 

& 

3q£A(q < ? 

P) 

& p G H} 

{(7, <$}: 

((7 ,S),p) 

G<7rj 

& 

3q G A (q < ( 

P) 

fcpG H} 

{(7, <5): 

«7,<5>,p> 

G<tt € 

& 

PG H} 



val ff (< 

)) 







proving (9.26). 

To argue for (9.24) let H be an M-generic filter in P^. We have to show 
that (valff(7Tj), valH(< 7 r $ )) = (A, val_f/(<^ 5 )) is a ccc partially ordered set 
in M[H ]. But, by (9.25), either A n H yf 0, in which case this is true since 
an element of A n H forces (7T^,< W? ) = (A, crf|) to be a ccc forcing, or 
B fl H ^ 0, in which case val_y(< W£ ) is just a standard well ordering of A, 
so it is obviously ccc. This finishes the inductive construction. 

Now, to see that P3 is satisfied note that P p = U^< M Pf has cardinality 
< W2. Thus |(P M )“| < (w 2 ) Wl = 2 W1 = UJ 2 . ' 

To see that P4 holds let G be an M- -generic filter in P^, let N = M[G], 
and let (A, <C) G N be such that 

N 1= A < and (A, <C) is ccc. 

Choose an a 0 < P- By Theorem 9.5.6 it is enough to find ag < £ < p such 
that P^ = (val G ,(7T|), val G , (<^7} = (A, <C). But, by Lemma 9.5.8, there is 
«o < C < 9 such that (A, <C) € M[G^]. Take a P^-name a* € M such that 
val Gf (cr*) is equal to <C and let 


a = {(( 7 ,6},p}-. 7,<5 G A & p £ P f & p ||- (7,6) G a*} G M. 


Note that 


val Gc (cr) = val Gc (a*) 
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since for every (7, 8) G A x A 

(7, 8) G val Gf (a) 3p G Gy (p ||— (7, 8) € a*) o (j,8) G val G< (cr*). 

Next, choose a < p such that (A £,cr£) = (A ,a) and pick £ < /i with 
(«{ , /%) = (a, C)- Then a 0 < C = /% < £ and A„| = A£ = A, so val G( , (717) = 
A. Also, <rf^ = <r£ = <7 and 

val G? (cr) = val Gc (cr) = val Gc (ct*) = (<) G M[G C \ C M[G^ 

since = G^ ft P G 

Thus it suffices to argue that 

val G£ (< W5 ) = val G{ (cr). (9.27) 

But (A, <) is a partially ordered set in M[G^\, since it is a partially ordered 
set in N and this is an absolute property. Also, (A, •C) is ccc in M[G^\, 
because every antichain A G M[G^\ is also an antichain in N and 

w= |4| w = \A\ M = \A\ M ™, 

since (A, <C) is ccc in N and both extensions of M preserve the cardinals. 
Thus there is a p G Gj that forces this to be true for (A, a) = (A, a a\)- In 
particular, Gj fl A ^ 0 so (9.26) and trfj = a imply (9.27). This finishes 
the proof. □ 


EXERCISES 

1 Complete the details of the proof of (9.8). 

2 Complete the details of the proof of Lemma 9.5.1. 

3 Prove that the relation A defined on an iteration P*7r is transitive. 

4 Prove (9.10). 

5 Complete the proof of Theorem 9.5.3 by showing that the set D from 
(9.11) is indeed dense below (<7,77). 

6 Prove Theorem 9.5.4. 

7 Prove properties (9.14)-(9.18). 

8 Prove (9.17) for the forcing defined by (9.19). 

9 Complete the proof of Theorem 9.5.7 by showing that the condition q 
defined by (9.23) has the desired properties. 
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Axioms of set theory 


Axiom 0 (Set Existence) There exists a set: 

3x(x = x). 

Axiom 1 (Extensionality) If x and y have the same elements, then x is 
equal to y: 

\/x\/y \iz(z £ x<-^z £ y)—>x = y\. 

Axiom 2 (Comprehension scheme or schema of separation) For every for- 
mula <p(s, t) with free variables s and t, for every x, and for every 
parameter p there exists a set y = {u £ x: ip(u,p)} that contains all 
those u £ x that have property ip: 

\/x\/p3y [Vu(« £ y<-^(u £ x & ip(u,p)))]. 


Axiom 3 (Pairing) For any a and b there exists a set x that contains a 
and b: 

\/afb3x(a £ x & b £ x). 

Axiom 4 (Union) For every family T there exists a set U containing the 
union [J T of all elements of T : 

T3UNYM x[(x £Y kY £ T)->x £ If}. 


Axiom 5 (Power set) For every set X there exists a set P containing the 
set V(X) (the power set) of all subsets of X : 

\/X3P\/z [z C £ P]. 
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To make the statement of the next axiom more readable we introduce 
the following abbreviation. We say that y is a successor of x and write 
y = S(x) if S(x) = x U {x}, that is, 

V z[z G y <-> (z € x V z = a;)]. 

Axiom 6 (Infinity) (Zermelo 1908) There exists an infinite set (of some 
special form): 

3a; \iz(z = 0— G x) & \/y G x\z{z = S(y)—>z G a;)]. 

Axiom 7 (Replacement scheme) (Fraenkel 1922; Skolem 1922) For every 
formula <p(s, t, U, w) with free variables s, t, U, and w, every set A , 
and every parameter p if ip(s,t, A,p) defines a function F on A by 
F( x) = y ip(x,y, A,p), then there exists a set Y containing the 
range F[A\ = {F(x) : x G A} of the function F: 

VAVp[Vx G A3\yip(x,y, A,p)^>3YMx G A3y G Yip(x,y,A,p)}, 

where the quantifier 3!a;^(a;) is an abbreviation for “there exists pre- 
cisely one x satisfying ip,” that is, is equivalent to the formula 

3xip{x) & \/x\/y(p(x) & ip(y)~^x = y). 

Axiom 8 (Foundation or regularity) (Skolem 1922; von Neumann 1925) 
Every nonempty set has an G-minimal element: 

\/x [3y(y G x)— >3 y(y G x & ->3z(z G x & z G y))]. 

Axiom 9 (Choice) (Levi 1902; Zermelo 1904) For every family T of dis- 
joint nonempty sets there exists a “selector,” that is, a set S that 
intersects every x G T in precisely one point: 

VF [Vx G F{x yf 0) & Vx G TVy G T{x = y V x n y = 0)] 

-^■3SVx G F3\z[z G S & z G x), 
where x fl y = 0 is an abbreviation for 

->3 z(z G x & z G y). 

Using the comprehensive schema for Axioms 0, 3, 4, and 5 we may 
easily obtain the following strengthening of them, which is often used as 
the original axioms. 
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Axiom O' (Empty set) There exists the empty set 0: 

3x\/y^(y £ x). 

Axiom 3' (Pairing) For any a and b there exists a set x that contains 
precisely a and b. 

Axiom 4' (Union) (Cantor 1899; Zermelo 1908) For every family T there 
exists a set U = (J T , the union of all subsets of T. 

Axiom 5' (Power set) (Zermelo 1908) For every set x there exists a set 
Y = V(x), the power set of x, that is, the set of all subsets of x. 

The system of Axioms 0-8 is usually called Zermelo-Fraenkel set theory 
and is abbreviated by ZF. The system of Axioms 0-9 is usually denoted by 
ZFC. Thus, ZFC is the same as ZF+AC, where AC stands for the axiom 
of choice. 

Historical Remark (Levy 1979) The first similar system of axioms was 
introduced by Zermelo. However, he did not have the axioms of founda- 
tion and replacement. Informal versions of the axiom of replacement were 
suggested by Cantor (1899) and Mirimanoff (1917). Formal versions were 
introduced by Fraenkel (1922) and Skolem (1922). The axiom of foundation 
was added by Skolem (1922) and von Neumann (1925). 

Notice that Axioms 2 and 7 are in fact the schemas for infinitely many 
axioms, one for each formula ip. Thus theory ZFC has, in fact, infinitely 
many axioms. Axiom 1 of extensionality is the most fundamental one. 
Axioms 0 (or O') of set existence (of empty set) and 6 of infinity are “ex- 
istence” axioms that postulate the existence of some sets. It is obvious 
that Axiom 0 follows from Axiom 6. Axioms 2 of comprehension and 7 
of replacement are schemas for infinitely many axioms. Axioms 3 (or 3') 
of pairing, 4 (or 4') of union, and 5 (or 5') of power set are conditional 
existence axioms. The existing sets postulated by 3', 4', and 5' are unique. 
It is not difficult to see that Axiom 3 of pairing follows from the others. 

The axiom of choice AC and the axiom of foundation also have the 
same conditional existence character. However, the sets existing by them 
do not have to be unique. Moreover, the axiom of foundation has a very 
set-theoretic meaning and is seldom used outside abstract set theory or 
logic. It lets us build “hierarchical models” of set theory. During this 
course we very seldom make use of it. The axiom of choice, on the other 
hand, is one of the most important tools in this course. It is true that 
its nonconstructive character caused, in the past, some mathematicians to 
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reject it (for example, Borel and Lebesgue). However, this discussion has 
been for the most part resolved today in favor of accepting this axiom. 

It follows from the foregoing discussion that we can remove Axioms 0 
and 3 from 0-8 and still have the same theory ZF. 



Appendix B 

Comments on the forcing 
method 


This appendix contains some comments and explanations for Section 9.1. 
It also includes some missing proofs of the results stated there. 

We start by explaining a few more terms. In Section 9.1 we said that a 
formula ip is a consequence of a theory T, T b ip, if there is a formal proof 
of ip from T. By a formal proof in this statement we mean a formalization 
of what we really do in proving theorems. Thus a proof of ip from T is a 
finite sequence ipo, . . . ,ip n of formulas such that ip n = p and a formula ipk 
can appear in the sequence only because of one of following two reasons: 

• ipk is an axiom, that is, it belongs to T or is a logic axiom; or 

• ipk is obtained from some ipi and ipj ( i,j < k) by a rule of detachment 
(also called a modus ponens rule), that is, if there exist i,j<k such that 
Pj has the form ipi—^ipk- 

Since any formal proof is a finite sequence of formulas, it can contain only 
finitely many sentences from T. Thus, if T b ip then there is a finite sub- 
theory To C T such that To b ip. In particular, if theory T is inconsistent, 
then there is a finite subtheory To of T that is also inconsistent. So theory 
T is consistent if and only if every one of its finite subtheories is consistent. 

Note also that if ip$, . . . , ip n is a proof oi tp — <p n in theory TU{i/>} then 
fif-^ipo ), . . . , (ip—Mp n ) is a proof of ip-^np in T, where formula if-+((pi— j npk) 
is identified with ipi—^ipip-^npk). Thus, if ip is a consequence of a finite 

theory T 0 = {tp 0 , . . . ,ip n } then 0 b (ip 0 ->( ■ ■ ■ )), that is, 

is a consequence of the axioms of logic. 

To argue for the forcing principle we will need the following theorem. 
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Theorem B.l Let S and T be two theories and assume that for every fi- 
nite subtheory Sq of S we can prove in theory T that there exists a nonempty 
transitive set M that is a model for Sq. Then Con(T) implies Con(S'). 

Proof Let S and T be as in the theorem and assume that S is inconsistent. 
It is enough to show that this implies the inconsistency of T . 

Since S is inconsistent, there is a finite subtheory So = {ifi, . . . ,if n } 
of S such that So is inconsistent, that is, such that So b ph^p for some 
sentence ip. In particular, (ifi&z • ■ ■ Szif n )— >(p&z^p). 

But, by our assumption, in theory T we can prove the existence of a 
nonempty transitive set M that is a model for Sq, that is, such that if M 
holds for every if from So- In particular, 

T b 3M (if^k ■ ■ ■ kifff ). 


Now, it is enough to notice that the rules of formal deduction are set up 
in such a way that if ipo, ■ ■ ■ ,p n is a formal proof of < pSz~>p from Sq then 
pff , . . . , ipn is a formal proof of p M k,~<p M from Sq 1 = {if f 1 , . . . , if In 
particular, the implication 


is true for every M . So we have proved in T that there is an M such that 
i tff I &---&ifff, while (iff 4 & ■ ■ ■ $zifff)—>(<p M &^<p M ) is true for every M. 
So 3 M (ip M &z-np M ) is a consequence of T, that is, T is inconsistent. □ 

Now assume that we have proved the following condition (F) from the 
forcing principle: 

(F) Every CTM M of ZFC can be extended to a CTM N of ZFC +“if.” 
If we could prove in ZFC that 


3 M (. M is a CTM for ZFC) 


then we would conclude in ZFC that there exists a model N of ZFC +“if,” 
and this, by Theorem B.l, implies that Con(ZFC)=> Con(ZFC +“if”). Un- 
fortunately, the existence of a countable transitive model of ZFC cannot 
be proved from ZFC axioms (this follows from Theorem 1.1.1). Thus we 
need the following more refined argument. 

A closer look at the forcing method shows that it lets us extend any 
countable transitive set M to another countable transitive set N = M[G]. 
Then Theorem 9.2.2 asserts that if M satisfies ZFC then so does M[G\. 
However, an examination of this proof (which is not included in this text) 
shows that checking whether M[G\ has a given property ip is of finitistic 
character in the sense that in the proof of the implication 


if M is a model for ZFC then M[G\ is a model for “ip" (B.l) 
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we use the knowledge that M is a model for a given axiom if only for 
finitely many axioms if of ZFC. In particular, for any sentence <p for which 
(B.l) holds there is a finite subtheory T v of ZFC such that the implication 

if M is a model for T v then M[G\ is a model for “ip” (B.2) 

has the same proof as (B.l). To show that such a nice proof of (F) implies 
Con(ZFC)=t- Con(ZFC +“if”) we need one more theorem, which will be left 
without proof. 

Theorem B.2 For every finite subtheory T of ZFC it is provable in ZFC 
that there exists a countable transitive model M ofT. 

Now, to argue for the forcing principle assume, to obtain a contradic- 
tion, that (F) holds and ZFC+“^” is inconsistent. Then there exists a 
finite subtheory S of ZFG+“if” that is also inconsistent. For every ip from 
S let T v be a finite subset of ZFC for which (B.2) holds and let T be the 
union of all T v for <p from S. By Theorem B.2 there is a CTM M for T, 
and using (B.2) we conclude that there exists a countable transitive model 
N = M[G\ for S. Since all of this was proved in ZFC, we can use Theo- 
rem B.l to deduce that Con(ZFC) implies Con(S'). But S is inconsistent, 
so it can only happen if ZFC is inconsistent. This finishes the argument 
for the forcing principle. 

We will finish this appendix with the list of Ao-formulas that constitute 
the 

Proof of Lemma 9.2.5 

(0) Formula ifo(x,y) representing x C y: 

\/w £ x(w £ y). 

(1) Formula ifi(x,y) representing y = (Jx: 

\/w £ y3z £ x{w £ z) & Vz £ x\/w £ z(w £ y). 

(2) Formula if 2 (x,y) representing y = p|ai: 

\/w £ yiz £ x(w £ z) & Vz £ x\/u> £ z\iu £ x(w £ u)—>(w £ y)\. 

(3) Formula ifs(x, y, z) representing z = x U y: 

\/w £ z(w £ x V w £ y) & Vu> £ x{w £ z) Sz \/w £ y(w £ z). 
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(4) Formula ipi(x, y, z) representing z = x D y: 

Vie £ z(w £ x & w £ y) & Vie £ x[w £ y^w £ z]. 

(5) Formula ip 5 (x, y, z) representing z = x\y: 

Vie £ z(w £ x & w £ y) & Vie £ x{w (jL y^w £ z). 

(6) Formula ip 6 (x, y, z) representing 2 = {x, y}: 

x£zSzy£zSz\/t£z(t = x V t = y). 

(7) Formula 1^7 (z) representing “z is an unordered pair”: 

3x £ z3y £ z ip 6 (x,y,z). 

(8) Formula il>$(x,y,z) representing z = (x,y) = {{a;}, {x, j/} } : 

3u £ z3w £ z[il>e(x, x, u) & ip&(x , y, w ) & i>e{ u > ie, z)]. 

(9) Formula i/>g(z) representing “z is an ordered pair”: 

3ie £ z3 x £ w3y £ wipg(x, y, z). 

(10) Formula ipw{x, y, z) representing z = x x y: 

Vie £ z3s £ x3t £ y ^(s, t, w) & Vs £ x\/t £ i/3ie £ z t , ie). 

(11) Formula i/>n(r) representing “r is a binary relation”: 

Vie £ r ^)g(ie). 

(12) Formula i/> 12 (d, r) representing “d is the domain of a binary rela- 
tion r”: 

Va: £ eEb £ r3ie £ z3y £ ie ^(^j V, z) 

Sz\/z £ rVie £ z\/ x £ ieVy £ ie[i/>8(a:, j/, 2 )— >a: £ d]. 

(13) Formula ipi 3 (R, r) representing “i? is the range of a binary relation r”: 

Vy £ R3z £ r3ie £ z3x £ ie ipg(x, y, z) 

& Vz £ rVie £ zVa; £ leVy £ w[i/jg (x,y, z)^>y £ /?]. 
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(14) Formula 0i4(/) representing “/ is a function”: 

0n(/) & Vp G fiq G /Vu G pVio G gVa G u\!b G uVd G tu 
[[V> 8 (a,6,p) & 08(M,d)H& = d]. 

(15) Formula V’is (/) representing “function / is injective”: 

ipuif) & Vp G /Vg G /Vw € pVu; € gVa G wV6 G wVc G wA/d G w 
[[(if s (a,b,p) & ip s (c,d,q) & b = d)]— >a = c]. 

(16) Formula 0i6(<,P) representing “< is a partial-order relation on P”: 

0n(<) & V , i2(Pi<) & 0i 3 (P,<) & Va; € P3u> G< ips{x,x,w) 

& V a; G PVp G PVu G< Vu; G< [(0s(a;, p, v) & ^(j/i x, w)) — > a: = p] 
& Va: G PVp G PVz G PVu G< Vu; G< 

[(0s(a:,p,u) & ip 8 (y,z,w)) -> 3u G< 0s (a;, 2,u)]. 

(17) Formula tpn(D, <,P) representing “I? is a dense subset of the par- 
tially ordered set (P, <)”: 

i/Die (< , IP) & tpo(D, P) & Va; G P3d G Z?3u; G< ip 8 {d,x,w). 

(18) Formula ipi 8 (A, <, P) representing “A is an antichain in the partially 
ordered set (P, <)”: 

V’ie(<, P) & 0 O (A P) & Va G AV6 G A'ix G PVu g< Vu; g< 

[(08 (a:, a, u) & 0s(x, b, u)) — > a = 6]. 

(19) Formula 0ig(a:) representing “set a: is transitive”: 

Vp G a:Vz G 2/(2: G x). 

(20) Formula 02o(a) representing “a is an ordinal number”: 

019 (a) & V/3 G aVy G a(/3 = 7 V /3 G 7 V 7 G /3). 

(21) Formula 02i(a) representing “a is a limit ordinal number”: 

02o (a) & V/3 G a37 G a(/3 G 7). 

(22) Formula 022(a) representing “a = u”: 

1 P 21 (a) & V/3 G a[02i(/3) — > Va; G /3(a: ± a;)]. 

(23) Formula ip 23(a) representing “a is a finite ordinal number”: 

02 o(a) & — >-022 (a) & V/3 G a(->022(/3)). 

(24) Formula 023(a) representing “a is a successor ordinal number”: 

■020(a) & -•021 (a)- 


□ 
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• x € y - x is an element of y, 6. 

• —up - the negation of formula <p, 6. 

• ip&zip - the conjunction of formulas ip and ip, 6. 

• ip V ip - the disjunction of formulas ip and ip, 6. 

• ip^ip - the implication, 6. 

• ip-^ip - the equivalence of formulas ip and ip, 6. 

• 3xip - the existential quantifier, 6. 

• \/xip - the universal quantifier, 6. 

• 3x € Aip - a bounded existential quantifier, 6. 

• \/x € Aip - a bounded universal quantifier, 6. 

• x C y - x is a subset of y, 6. 

• 0 - the empty set, 7. 

• IJJ 7 the union of a family T of sets, 8. 

• V{X) - the power set of a set X, 8. 

• x U y - the union of sets x and y, 8. 

• x\y - the difference of sets x and y, 8. 

• J T - the intersection of a family T of sets, 8. 

• x n y - the intersection of sets x and y, 9. 
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• xAy - the symmetric difference of sets x and y, 9. 

• (a, b) - the ordered pair {{a}, {a, &}}, 9. 

• (ffli, a 2 , . . . , a n -i, On) - the ordered n-tuple, 10. 

• X x Y - the Cartesian product of sets X and Y, 10. 

• S(x) - the successor of x: x U {a;}, 10. 

• clom(l?) - the domain of a relation (or function) R , 12. 

• range(l?) - the range of a relation (or function) R , 12. 

• 1 - the inverse of a relation (or function) R, 13. 

• S o R - the composition of the relations (or functions) R and S, 13. 

• /: X — > Y - a function from a set X into a set Y, 16. 

• Y x - the class of all function from a set X into a set Y, 16. 

• f[A\ - the image of a set A with respect to a function /, 16. 

• f~ 1 (B) - the preimage of a set B with respect to a function /, 16. 

• f\A ~ the restriction of a function / to a set A, 18. 

• UteT Ft ~ the union of an indexed family {F t }teT, 19. 

• Piter Ft - the intersection of an indexed family {F t } te r, 19. 

• Titer Ft ~ the Cartesian product of an indexed family {F t } te T, 20. 

• N - the set of natural numbers, 26. 

• lo - the set of natural numbers, 27; 

the order type of an infinite strictly increasing sequence, 39; 
the first infinite ordinal number, 44. 

• Z - the set of integers, 30. 

• Q - the set of rational numbers, 30. 

• R - the set of real numbers, 31. 

• B(p , £■) - the open ball in R” with center p and radius e, 32. 

• int (.S') - the interior of a set S in R", 33. 

• cl(S) - the closure of a set S in R”, 33. 

• u>* - the order type of an infinite strictly decreasing sequence, 39. 
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• 0(x o) - the initial segment {x € X : x < cco}, 40. 

• Otp(W) - the order type of a well-ordered set W, 47. 

• |A| - the cardinality of a set A, 61. 

• 2 K - the cardinality of the set 'P(k), 65. 

• Xa ~ the characteristic function of a set A, 65. 

• k + - the cardinal successor of a cardinal k, 65. 

• ui a - the ath infinite cardinal number, 66. 

• - aleph-alpha: the ath infinite cardinal number, 66. 

• D q bet-alpha, 66. 

• c - continuum: the cardinality of the set V(u > ), 66. 

• k © A - the cardinal sum of cardinals n and A, 68. 

• k (g> A the cardinal product of cardinals k and A, 68. 

• - cardinal exponentiation: the cardinality of the set 68. 

• A <UJ - the set of all finite sequences with values in A: Un^A", 71. 

• [X]- K - the family of all subsets of X of cardinality < k, 72. 

• [A'] <K - the family of all subsets of X of cardinality < n, 72. 

• [A'] K - the family of all subsets of A of cardinality k, 72. 

• C(R) - the family of all continuous functions /: R — > R, 73. 

• cf(a) - the cofinality of an ordinal number a, 74. 

• cl p{Z) - closure of a set Z under the action of T , 86. 

• F' - limit points of F C M", 92. 

• (j[7F\ - the smallest a-algebra containing T \ 94. 

• Bor - the cr-algebra of Borel sets, 94. 

• - subsets of Bor , 94. 

• 11° - subsets of Bor, 94. 

• A fT> the ideal of nowhere-dense sets, 98. 

• M - the cr-ideal of meager (first-category) sets, 98. 

• N - the cr-ideal of Lebesgue measure-zero (null) subsets of R", 99. 
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• LINq(_B) - linear subspace of R over Q generated by B C R, 111. 

• Func(X, Y) - all partial functions from X into Y, 131. 

• Func K (X,y) - all partial functions from X into Y with the domain of 
cardinality < k, 131. 

• <* - a relation on w", 135. 

• <* - a linear-order relation on uf 1 , 139. 

• =* - an equivalence relation on u) u , 139. 

• P = P 0 x Pi - the product of partially ordered sets Po and Pi, 149, 190. 

• trcl(a’) - the transitive closure of x, 165. 

• R(a) - 165. 

• rank(;r) - the rank of x, 166. 

• T b ip - sentence ip is a consequence of a theory T, 167. 

• Con(T) - theory T is consistent, 167. 

• ip M - the relativization of a formula ip to a set M, 168. 

• M \= ip - a formula ip is true in M, 168. 

• M[G\ - a generic extension of model M, 170, 175. 

• - uji in model M, 175. 

• c M c in model M, 175. 

• valc(r) - valuation of a P-name r, 175. 

• r - standard P-name for a generic filter in P, 176. 

• P ||— p,m P ~ P forces ip, 176. 

• || forcing relation, 176. 

• P ||— ip - every p £ P forces <p, 177. 

• 7T ||— p <p(x i, . . . , x n ) - formula for “p forces ip(x ±, . . . , x n ),” 177. 

• ||-* - 177. 

• P (Z) - Cohen forcing on a set Z, 192. 

• P * 7r - iteration of P and a good P-name n, 200. 

• G 0 *Gi - 200. 




References 


Cantor, G. 1899. Correspondence between Cantor and Dedekind (in Ger- 
man). In Cantor (1932: 443-51). 

Cantor, G. 1932. Gesammelte Abhandlungen: Mathematischen und philoso- 
phischen Inhalts. Berlin: Springer. 

Ciesielski, K. 1995-6. Uniformly antisymmetric functions and K 5 . Real 
Anal. Exchange 21 : 147-53. 

Ciesielski, K. 1996. Sum and difference free partitions of vector spaces. 
Colloq. Math. 71 : 263-71. 

Ciesielski, K., and Larson, L. 1993-4. Uniformly antisymmetric functions. 
Real Anal. Exchange 19 : 226-35. 

Ciesielski, K., and Miller, A. W. 1994-5. Cardinal invariants concerning 
functions whose sum is almost continuous. Real Anal. Exchange 20 : 
657-72. 

Erdos, P. 1969. Problems and results in chromatic graph theory. In Proof 
techniques in graph theory, ed. F. Harary, 27-35. New York: Aca- 
demic Press. 

Erdos, P. 1978-9. Measure theoretic, combinatorial and number theoretic 
problems concerning point sets in Euclidean space. Real Anal. Ex- 
change 4 : 113-38. 

Erdos, P., and Kakutani, S. 1943. On non-denumerable graphs. Bidl Amer. 
Math. Soc. 49 : 457-61. 

Fraenkel, A. A. 1922. Zu den Grundlagen der Cantor-Zermeloschen Men- 
genlehre. Math. Ann. 86 : 230-7. 

Frege, G. [1893] 1962. Grundgesetze der Arithmetik. Vol 1. Reprint. 
Hildesheim: Olms. 


225 



226 


References 


Freiling, C. 1986. Axioms of symmetry: throwing darts at the real number 
line. J. Symb. Logic 51: 190-200. 

Freiling, C. 1989-90. A converse to a theorem of Sierpinski on almost 
symmetric sets. Real Anal. Exchange 15: 760-7. 

Friedman, H. 1980. A consistent Fubini-Tonelli theorem for non-measurable 
functions. Illinois J. Math. 24: 390-5. 

Galvin, F. 1980. Chain conditions and products. Fund. Math. 108: 33-48. 

Kirchheim, B., and Natkaniec, T. 1990-1. On universally bad Darboux 
functions. Real Anal. Exchange 16: 481-6. 

Komjath, P., and Shelah, S. 1993-4. On uniformly antisymmetric functions. 
Real Anal. Exchange 19: 218-25. 

Kunen, K. 1980. Set Theory. Amsterdam: North-Holland. 

Levi, B. 1902. Intorno alia teoria clegli aggregati. Rendic., 2d ser. 35: 
863-8. 


Levy, A. 1979. Basic Set Theory. New York: Springer- Verlag. 

Mazurkiewicz, S. 1914. Sur un ensemble plan (in Polish). Comptes Rendus 
Sci. et Lettres de Varsovie 7: 382-3. French translation reprinted 
in Travaux de Topologie et ses Applications, 46-7. Warsaw: Polish 
Scientific Publishers PWN, 1969. 

Mirimanoff, D. 1917. Les antinomies de Russell et de Burali-Forti et le 
probleme fondamental de la theorie cles ensembles. L ’Enseignement 
Math. 19: 37-52. 

Royden, H. L. 1988. Real Analysis. 3d ed. New York: Macmillan. 

Russell, B. 1903. The Principles of Mathematics. Vol. 1. Cambridge: 
Cambridge University Press. 

Sierpinski, W. 1919. Sur un theorem equivalent a l’hypothese du continu. 
Bidl Int. Acad. Sci. Cracovie A: 1-3. Reprinted in Oeuvres Choisies, 
vol. 2, 272-4. Warsaw: Polish Scientific Publishers PWN, 1974. 

Sierpinski, W. 1920. Sur les rapports entre l’existence des integrates 
fo f( x iy)dx, f 0 f{x,y)dy et / 0 dx f* f(x,y)dy. Fund. Math. 1: 
142-7. Reprinted in Oeuvres Choisies, vol. 2, 341-5. Warsaw: Polish 
Scientific Publishers PWN, 1974. 



References 


227 


Sierpinski, W. 1936. Sur une fonction non measurable partout presque 
symetrique. Acta Litt. Scient (Szeged) 8: 1-6. Reprinted in Oeuvres 
Choisies, vol. 3, 277-81. Warsaw: Polish Scientific Publishers PWN, 
1974. 

Skolem, D. 1922. Einige Bemerkungen zur axiomatischen Begr undung der 
Mengenlehre. In Proceedings of the 5th Scandinavian Mathemati- 
cians’ Congress in Helsinki, 1922, 217-32. 

Steprans, J. 1993. A very discontinuous Borel function. J. Symbolic Logic 
58 : 1268-83. 

von Neumann, J. 1925. Zur Einfiihrung der transfiniten Zahlen. Acta Sci. 
Math. (Szeged) 1: 199-208. 

von Neumann, J. 1929. Eine Widerspruchsfreiheitsfrage in der axiomatis- 
chen Mengenlehre. Crelle J. 160 : 227. 

Zermelo, E. 1904. Beweis, claB jede Menge wohlgeordnet werden kann. 
Math. Ann. 59 : 514-16. 

Zermelo, E. 1908. Untersuchungen iiber die Grundlagen der Mengenlehre, 
I. Math. Ann. 65 : 261-81. 




Index 


Abelian group, 55 
absolute property, 172, 182 
AC, 15 

additive function, 111, 117 
Jones space of, 114 
Alexander subbase theorem, 58 
algebraic numbers, 73 
algebraically independent subset, 
60 

almost contained C*, 153 
almost-disjoint sets, 153 
antichain, 140, 174, 177 
maximal, 140 

antilexicographic-orcler relation, 43 
antisymmetric function, 119 
antisymmetric relation, 21 
arithmetic 

of cardinal numbers, 68 
of ordinal numbers, 48 
Peano axioms of, 26 
Aronszajn tree, 163 
axiom of choice, 4, 15, 18, 20, 52, 
54, 101 

axiom of empty set, 7 
axiom of extensionality, 6 
axiom of foundation, 37 
axiom of infinity, 10, 26, 182 
axiom of pairing, 8 
axiom of power set, 8 
axiom of regularity, 37 
axiom of set existence, 6 


axiom of union, 8 

axiom schema of comprehension, 

3 

axiom schema of separation, 7 

Baire category theorem, 33, 98 
Baire class-one function, 73 
Baire property, 98 
Baire-measurable functions, 104 
base, 58 

for R”, 89 
basis 

Hamel, 111, 115 
with Baire property, 117 
without Baire property, 113 
measurable, 117 
nonmeasurable, 113 
of a linear space, 55 
transcendental, 60 
Bernstein set, 103-5, 112, 117, 126 
bijection, 16 
binary relation, 12, 173 
^-measurable functions, 104 
Borel functions, 104 
Borel sets, 89, 97 
Borel cr-algebra Bor , 94 
bound variables in a formula, 166 
bounded quantifier, 168 
bounded subset of R”, 33 
branch of a tree, 163 


229 



230 


Index 


Cantor set, 71, 93, 111, 117 
Cantor-Bendixson theorem, 92 
Cantor’s theorem, 64 
cardinal number, 62 

collapsing of, 180, 185 
preservation of, 180, 188 
regular, 75, 144 
successor of, 65 
cardinal numbers 
arithmetic of, 68 
exponentiation of, 68 
product of, 68 
sum of, 68 
union of, 65 
cardinality 

continuum c, 66, 76, 195 
of a set, 61 

Cartesian product, 10, 20 
Cauchy sequence, 14, 33 
ccc forcing, 140, 149, 150, 179-81, 
195 

ccc iteration, 202, 206 
ccc linearly ordered set, 154, 162 
Ceder’s theorem, 89 
CH, 66 

chain in a partially ordered set, 
52 

characteristic function, 65 
choice function, 18 
Ciesielski-Larson theorem, 119, 121 
Ciesielski’s theorem, 124 
class 

equivalence, 14 
proper, of sets, 7, 44, 168 
quotient, 14 
of sets, 7 

closed set under the action of T ', 
85, 97 

closed subset of R", 32, 89 
closed unbounded subsets of uq, 
157 

diagonal intersection of, 161 


closure 

cl(5) of a set in R”, 33 
clyr(Z) of a set under the ac- 
tion of IF, 85 
transitive, trcl(a;), 165 
cofinality cf(a) of an ordinal num- 
ber, 74 

Cohen model, 182, 189 
Cohen real numbers, 182 
collapsing of cardinal numbers, 180, 
185 

compact subset of R™, 33 
compact topological space, 58 
comparable elements in a partially 
ordered set, 140 

compatible elements in a partially 
ordered set, 140 
compatible set, 147 
complete Boolean algebra, 140 
complete linearly ordered set, 135, 
154 

complete linear-order relation, 135, 
154 

complete set of axioms, 4 
composition of relations, 13 
comprehension, axiom schema of, 

3 

comprehension scheme, 7 
condition, 132 
ccc, 140 
stronger, 132 
conditions 

comparable, 140 
compatible, 140 
incompatible, 140 
connected relation, 23 
connected subset of R", 33, 114 
consistent set of axioms, 4 
consistent theory, 167 
consistent with ZFC, 66, 133, 141, 
155, 168 

constants in the language of set 
theory, 6, 168, 171, 174 



Index 


231 


continuous function, 33, 72, 107, 
111 

continuum c, 66, 76, 195 
continuum hypothesis, 66, 87, 89, 
105, 108, 111, 115, 122, 
124, 129, 136, 139, 157, 
168, 183 

generalized, 66, 107 
countable chain condition ccc, 140 
countable set, 64 
countably closed forcing, 183 
cover, 58 

Darboux function, 106 
decreasing function, 41 
strictly, 41, 51 
Dedekind cut, 31 
definitions by recursion, 49 
definitions by transfinite induction, 
49 

Ao-formula, 172 

A-system lemma, 150, 151, 153, 
154, 181, 206 
dense set, weakly, 139 
dense set below p, 183 
dense subset of a linearly ordered 
set, 133, 135, 154 
dense subset of a partially ordered 
set, 130, 132, 141, 174 
dense subset of R™, 33, 79, 99, 
117 

diagonal intersection of closed and 
unbounded sets, 162 
diagonalization, 81, 129, 198 
diamond principle <C>, 158, 182 
<C>-sequence, 158 
difference of sets, 8 
symmetric, 9 

difference-free partition, 124 
discrete space, 94 
discrete subset of R", 89 
distance in R™, 32, 86 
domain of a function, 16 


domain of a relation, 12, 173 

empty set axiom, 7 
equivalence class, 14 
equivalence relation, 14, 100 
Erdos-Hajnal theorem, 86, 115 
Erdos-Kakutani theorem, 115 
Euclidean space R™, 32 
exponentiation of cardinal num- 
bers, 68 
extension 

of a function, 18 
generic, 170 
extensionality axiom, 6 

field, 55 

of a relation, 12 
filter, 60, 130 

iF-generic, 132 
generic, 169 
prime, 60 
principal, 131 
proper, 60 
finite set, 64 
first element, 22 
first-category set, 98, 144 
Fodor’s theorem, 162 
forcing, 132 

cardinal-collapsing, 180 
cardinal-preserving, 180, 188 
countably closed, 183 
iterated, 189 
forcing method, 164 
forcing principle, 169 
forcing relation ||— , 177 
forest in a graph, 60 
formal proof, 167 
formula, 6, 166 

a consequence of a theory, 167 
A 0 -, 172 

||— p representing forcing re- 
lation, 177 
relativization of, 168 



232 


Index 


free variable, 166 
F a set, 94 

Fubini-Tonelli theorem, 105 
function, 16 

GCH, 66 
Gg set, 94, 99 

generalized continuum hypothesis, 
66, 107 

generic extension of a model, 170 
generic filter, 132, 169 
Godel’s incompleteness theorem 
first, 5, 167 
second, 4, 167 
good P-name, 199 
graph on a set, 60 
bipartite, 116 
forest in, 60 

greatest element, 22, 39 
ground model, 170 
group, 55 

Abelian, 55 

Hahn-Banach theorem, 54, 56 
Hamel basis, 111, 115 

with Baire property, 117 
without Baire property, 113 
measurable, 117 
nonmeasurable, 113 
Hausdorff maximal principle, 54, 
140 

ideal, 14, 98 

of nowhere-dense sets AfV, 98 
image, 16 

incompatible elements in a par- 
tially ordered set, 140 
inconsistent theory, 167 
increasing function, 41 

strictly, 41, 47, 74, 138, 139 
independence 
algebraic, 60 
of the axioms, 5 


linear, 55, 113, 123 
from ZFC, 5, 66, 110, 164, 
166, 168, 183 
index set, 19 
indexed family, 19 
induction 

on natural numbers, 27 
too short, 130 
transfinite, 40, 198 
definitions by, 49 
infinite set, 64 
infinity axiom, 10, 26, 182 
initial ordinal number, 62 
initial segment, 31, 39 
proper, 40, 41 
injective function, 16 
integers, 30 
integral, 105 

interior int(S') of a set, 32 
intermediate value theorem, 106 
intersection of sets, 8, 19 
diagonal, 162 
inverse relation, 13 
isolated point, 89 
isomorphism of ordered sets, 39, 
41, 47, 134, 135, 185 
iterated forcing, 189 
iterated integral, 105 
iteration of ccc forcings, 202, 206 
iteration of partially ordered sets, 
200, 204 ' 

Jones space, 114 

Kirchheim-Natkaniec theorem, 108 
Kuratowski’s theorem, 126 

language of set theory, 4, 6, 166 
largest element, 22 
last element, 22 
least element, 22 
least upper bound, 135 
Lebesgue measure, 101 



Index 


233 


Lebesgue measure-zero set, 98, 117, 
145, 193, 194 

Lebesgue-measurable functions, 104 
Lebesgue-measurable Hamel ba- 
sis, 117 

Lebesgue-measurable sets, 98 
lexicographic-order relation, 43 
limit points F' of F C R", 92 
linear function, 111 
linear functional, 56 
linear space, 55 
basis of, 55 

linear subspace, 55, 111 
linearly independent subset, 55, 
113, 123 

linearly ordered set, 23, 39, 42 
linear-order relation, 23, 60, 139, 
156 

lower bound, 52 

MA, 141 
map, 16 

cofinal, 74 

Martin’s axiom MA, 141, 155, 195, 
196 

maximal antichain, 140 
maximal element, 22, 39, 53, 54 
Mazurkiewicz’s theorem, 81 
meager set, 98, 144 
minimal element, 22, 39 
model 

Cohen, 182, 189 

ground, 170 

for theory T, 168 

for ZFC, 169 

for ZFC+“c = w 2 ,” 195 

for ZFC+0, 182 

for ZFC+MA+-.CH, 196 

for ZFC+- 'CH, 168 

for ZFC+-MA, 195 

for ZFC+“2 W1 = u > 2 ,” 188 


natural numbers, 4, 25 
non-Baire functions, 104 
non-Borel functions, 104 
nonmeasurable functions, 104 
nowhere-constant function, 108 
nowhere-dense set, 33, 98, 110, 
117 

nowhere-symmetrically continuous 
function, 118 
null set, 98, 145, 193, 194 

one-to-one function, 16 
onto function, 16 
open ball B(p,e) in M™, 32 
open cover, 58 
open set in R n , 32, 89 
order isomorphism, 39, 41, 47, 134, 
135, 185 
order relation 

antilexicographic-, 43 
complete, 135, 154 
lexicographic-, 43 
linear-, 23, 60, 139, 156 
partial-, 21, 60, 174 
order topology, 154, 157 
order type 
77, 135 
A, 135 
u*, 51 

Otp(lF) of a well-ordered set, 
47 

of a partially ordered set, 38 
ordered n-tuple, 10 
ordered pair, 9, 173 
order-isomorphic partially ordered 
sets, 38 

ordinal number, 44, 90, 174, 196 
cofinality of, 74 
initial, 62 
limit, 46, 174 
successor of, 46, 174 



234 


Index 


ordinal numbers 

arithmetic of, 48 
product of, 48 
sum of, 48 

pairing axiom, 8 

partial functions Func(X, Y) from 
X into Y, 131 
partially ordered set, 22 
partially ordered sets 
iteration of, 200, 204 
order isomorphic, 38 
product of, 149, 150 
partial-order relation, 21, 60, 174 
partition, 14, 86, 115, 119 
difference-free, 124 
sum- free, 124 

Peano axioms of arithmetic, 26 
perfect subset of R ra , 89, 90, 101, 
102 

P-name, 175, 177 
good, 199 
valuation of, 175 
poset, 22 
power set, 8 
power set axiom, 8 
preimage, 16 
preorder relation, 24, 139 
preservation, in induction proofs, 
81 

preservation of cardinal numbers, 
180, 188 

preservation of properties in generic 
extensions, 172 
pressing-down lemma, 162 
principle of mathematical induc- 
tion, 26 

principle of transfinite induction, 
40 

product 

of cardinal numbers, 68 
Cartesian, 10, 20 
of ordinal numbers, 48 


of partially ordered sets, 149, 
150 

Tychonoff, 58 
product lemma, 190 
proper initial segment, 40, 41 

quantifier, 6, 166 
bounded, 168 
unbounded, 168 
quotient class, 14 

range of a function, 16 
range of a relation, 12, 174 
rank of a set, 166 
Rasiowa-Sikorski lemma, 132, 139, 
169 

rational numbers, 30 
real function, 104 
real numbers, 25, 31 
Cohen, 182 

recursion definitions, 49 
recursion theorem, 49 
reflexive relation, 13 
regressive function, 162 
regular cardinal numbers, 75, 144 
relation 

antisymmetric, 21 
binary, 12, 173 
connected, 23 
domain of, 12, 173 
equivalence, 14, 100 
forcing ||— , 177 
||-*, 177 
inverse, 13 
order 

antilexicographic-, 43 
complete, 135, 154 
lexicographic-, 43 
linear-, 23, 60, 139, 156 
partial-, 21, 60, 174 
preorder, 24, 139 
range of, 12, 174 
reflexive, 13 



Index 


235 


restriction of, 23 
=*, 139 
<*, 136 
<*, 139 
symmetric, 13 
transitive, 13 
well-founded, 37, 163 
well-ordering, 38, 90 
relations 

composition of, 13 
in the language of set theory, 
168 

replacement scheme axiom, 17 
restriction of a function, 18 
restriction of an order relation, 23 
Russell’s paradox, 3, 7 

scale, 136, 141 

Schroder-Bernstein theorem, 64 
selector, 15, 117 
sentence, 167 

a consequence of a theory, 167 
consistent with a theory, 167 
independent of a theory, 5, 
167 

sequence, 18 

Cauchy, 14, 33 
0 -, 158 
transfinite, 48 
set-theoretic universe, 7 
Sierpinski -Zygmund theorem, 111 
Sierpinski’s theorem, 88, 105 
a- algebra, 93 

of Borel sets Bor, 94 
of Lebesgue-measurable sets 
C, 101 

of sets Baire with the Baire 
property, 101 
er-ideal, 98 

of first-category sets, 98, 144 
of Lebesgue measure-zero sets, 
98, 145, 193, 194 
of meager sets, 98, 144 


of null sets, 98, 145, 193, 194 
smallest element, 22, 39 
stationary set, 158, 162 
strong measure-zero set, 153 
strongly Darboux function, 106 
subbase of a topological space, 58 
subcover, 58 
successor, 10, 26 

of a cardinal number, 65 
of an ordinal number, 46, 174 
sum of cardinal numbers, 68 
sum of ordinal numbers, 48 
sum-free partition, 124 
surjective function, 16 
Suslin hypothesis SH, 155 
Suslin line, 155 
Suslin tree, 163 
symmetric difference, 9 
symmetric relation, 13 
symmetrically continuous function, 
118 

nowhere, 118 

theory, 167 

consistent, 167 
inconsistent, 167 
topological properties of R”, 32 
topological space, 58 
base of, 58 

Borel structure of, 93 
compact, 58 
discrete, 94 
subbase of, 58 
topology, 58 

order, 154, 157 
on R", 32 

transcendental basis, 60 
transfinite induction, 40, 198 
definitions by, 49 
transfinite sequence, 48 
transformation, 16 
transitive closure trcl(a;) of a set, 
165 



236 


Index 


transitive model of ZFC, 169 
transitive relation, 13 
transitive set, 164, 174 
tree, 163 

Aronszajn, 163 
branch of, 163 
Suslin, 163 

tree construction, 92, 102 
Tychonoff product, 58 
Tychonoff theorem, 54, 58 

unbounded quantifier, 168 
unbounded subset in an ordinal 
number, 74 
uncountable set, 64 
uniformly antisymmetric function, 
119 

union axiom, 8 

union of cardinal numbers, 65 
union of sets, 8 
upper bound, 52 
least, 135 


valuation, 175 
variable, 6, 166 
bound, 166 
free, 166 
vector space, 55 
Vitali set, 117, 126 

weakly dense set, 139 
well-founded relation, 37, 163 
well-ordered set, 38, 39, 42, 47, 51 
type of, 47 

well-ordering relation, 38, 90 
well-ordering theorem, 51 

Zermelo-Fraenkel axioms, 4 
Zermelo’s theorem, 51 
ZF, 52 
ZFC, 4 

Zorn’s lemma, 53, 54 



